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SOCIETAL TAXONOMIES: 
Mapping the Social Universe 


Gerhard Lenski 
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Abstract 


Taxonomies have played a much more important role in the development of 
the sciences than is generally recognized by sociologists. They have provided 
both a foundation for the formulation of basic theory and a spur to innovative 
research. This paper traces the development from ancient times to the present 
of efforts to create a taxonomy of human societies, the most basic and inclusive 
of all social systems. It concludes with a discussion of the question of whether 
there are compelling reasons for preferring any one of the various taxonomies 
that have been devised. Four principles for evaluating taxonomies are proposed 
and applied. 


MAPPING THE SOCIAL UNIVERSE 


Comprehensive taxonomies that map, or define, relationships among phenom- 
ena have been far more important in the development of the various sciences 
than most sociologists recognize. It is hard to imagine, for example, Darwin's 
critical contributions to biological theory without the foundation laid in the 
previous century by Linnaeus and other pioneering taxonomists. And this is 
no isolated example: In geology, the systematic mapping of rock formations 
laid the necessary foundation for the modern understanding of plate-techtonics, 
and molecular biologists are now hard at work mapping the human genome. 
Other taxonomies such as the periodic table, and taxonomies for such varied 
phenomena as subatomic particles and star types, have been no less important 
in the successes and achievements of chemistry, physics, and astronomy. 
Comprehensive taxonomies that are grounded in careful observations—even 
when incomplete or incorrect in earlier formulations—provide both a founda- 
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tion for the, formulation of basic theory (e.g. evolutionary theory in biology) 
and a spur to itinovative research (e.g. the search for new kinds of particles 
and forces in physics). Taxonomies invite the question that is basic to the 
whole scientific enterprise: Why are some entities similar to one another but 
different from others? In short, why are things as they are? 

Efforts to construct a viable taxonomy of the social universe were once an 
important activity of social scientists, though today, unfortunately, that is no 
longer true. This is both surprising and troubling, because as Stephen Jay Gould 
has observed, “taxonomies are theories of order.” In the language of modern 
physics, one can also describe them as efforts to separate meaningful signal 
(i.e. significant patterns) from meaningless noise (i.e. random variation). 

The present analysis surveys the various attempts over the centuries to 
construct a viable taxonomy of human societies, the primary organizational 
subdivisions of the human population, and it gives special attention to the more 
recent abandonment of efforts to construct a comprehensive taxonomy and the 
resultant proliferation of ad hoc, partial taxonomies. The paper concludes by ` 
defining, and then applying, a set of four criteria that may help us to choose 
among the many and varied taxonomies now confronting us. 


PREMODERN SOCIETAL TAXONOMIES 


It may be well to note at the outset that modern scientific taxonomies have 
evolved as an extension of the universal human practice of classifying the 
myriad phenomena encountered in the course of existence—a practice that is 
the basis of language itself and prerequisite to all rational thought and action. 
With respect to the classification of human societies, anthropologists who have 
studied the simplest societies report that members of these groups almost 
always have terms by which they differentiate themselves from other peoples. 
At a slightly higher level of abstraction, the early Greeks made a more encom- 
passing distinction between the entire Greek-speaking population and all other 
peoples, whom they classified as barbarians. 

By the fifth century BC, early Greek philosophers had moved beyond these 
simple, popular formulations and had begun to develop the first analytical 
taxonomies. Although Plato’s analysis of societal constitutions in the eighth 
book of The Republic is the oldest taxonomy of this kind that has survived, it 
is clear from the contents that its author was reacting to ideas and concepts 
propounded and debated by others before him. 

As befitting an Athenian of noble birth, Plato was concerned primarily with 
the political character of societies, the nature of those who ruled, and the nature 
of relations among ‘classes. He identified five types of societal constitutions 
that he judged to form a descending scale of virtue, with aristocracies at the 
top and democracies and tyrannies at the bottom. 

Plato’s views and concepts did not go unchallenged for long. Aristotle, the 
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greatest of his students, while incorporating some of Plato's ideas, rejected the 
view that virtue is linked to size of the ruling group: he argued that there are 
both corrupt and virtuous forms of ruling groups of every size. Thus, oligarchy 
is a corrupt form of aristocracy, just as democracy is a corrupt form of 
constitutional government. For Aristotle, the critical distinction was whether 
a society's polity served the common good or merely the interests of some 
limited group within society. In democracies, the interests of the poor predom- 
inate; in oligarchies, the interests of the wealthy. Aristotle preferred societies 
that promoted the common good. He also favored those that promoted the rule 
of law over those that favored the rule of men, and those that gave power and 
influence to the middle classes, whose interests he saw as more congruent with 
the common good than the interests of either the upper or lower classes. 

In the centuries that followed, advances were made in various aspects of 
social philosophy, but it is hard to find evidence of significant advances in the 
delineation of basic societal formations prior to the eighteenth century. During 
this period, Ibn Khaldun (1332-1406) may well have been the most insightful 
and innovative student of human societies, but his contribution to the devel- 
opment of a useful taxonomy of societies was negligible. 


EARLY MODERN TAXONOMIES: THE EIGHTEENTH 
CENTURY 


By the eighteenth century, social philosophy in western Europe had begun to 
escape from the stifling control of theology established in the medieval period. 
Humanistic explanations of social and cultural phenomena were beginning to 
find expression once more. And, after centuries of neglect, the inductive 
method was again beginning to find favor, thanks to the work of Bacon, 
Galileo, Harvey, Descartes, de Fontenelle, Newton, and others. In brief, a 
foundation was slowly being laid for the development of what we today think 
of as modern science. 

Equally important for the development of the social sciences, the voyages 
of exploration and discovery, which brought Europeans into direct contact for 
the first time with the peoples of Asia, sub-Saharan Africa, and the Americas, 
stimulated some to rethink the Eurocentric view of history that had prevailed 
up to that time. By combining traditional sources of information about the 
early histories of Middle Eastern and European societies with the rapidly 
growing store of information about societies in other parts of the world, the 
foundation was laid for a new, broadly comparative, and evolutionary view of 
human societies. 

Perhaps the first modern writer to develop a broadly comprehensive and 
secular or humanistic taxonomy of societies was a professor of Latin Eloquence 
at the University of Naples, Giambattista Vico (1668-1744). In his Principles 
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of a New Science, Vico (1961 [1725], pp. 3 ff and 283 ff) presented a striking 
three-stage, developmental taxonomy: societies in (i) the age of gods, (ii) the 
age of heroes, and (iii) the age of men. In the first of these stages, societies 
were theocratic and laws and other social institutions were thought to have 
divine origins. In the second, societies were aristocratic commonwealths and 
ruled by the law of force, though the exercise of force was controlled by 
religious ideas. In the third stage, popular commonwealths prevailed and laws 
were the product of human reason. 

Vico’s developmental, or proto-evolutionary, view of human societies and 
their history became a central element in most of the taxonomies that followed 
in the next two hundred years. In some cases, especially in the writings of 
Saint-Simon and Comte (1830-1842), dependence on Vico extended even to 
the substance of the developmental process itself: for them, as for Vico, human 
history is a progressive movement in ideologies from theism to rationalism—a 
theme that would be echoed later by Max Weber. 

More immediately, however, the proto-evolutionary model found expression 
in the midcentury writings of French intellectuals. Turgot is noteworthy in this 
regard because of the importance he attributed to the influence of material 
factors, most notably subsistence technology, on other societal attributes. Al- 
though Turgot did not develop an explicit taxonomy of societies in his unpub- 
lished manuscript on universal history (1750), the opening pages provide a 
presumed sequence of societal development beginning with societies depen- 
dent on hunting, followed by pastoral societies, and then by agricultural soci- 
eties. It is also interesting to observe in writings of this period references to 
“savage,” “ barbarian,” and “civilized” societies as alternative labels for hunt- 
ing, pastoral, and agricultural societies. This usage came to be adopted widely 
later on, especially in the writings of early anthropologists. 

In the next several decades, this early and more or less implicit taxonomy 
based on the means of subsistence became increasingly explicit and was, with 
various modifications, used as the basis for increasingly sophisticated analyses 
by a growing number of Enlightenment intellectuals. John Millar of Scotland, 
for example, in The Origin of the Distinction of Ranks (1960 [1771]), argued 
that to find the causes of the differences among societies, we must start with 
environmental differences and the differences in modes of subsistence to which 
they give rise (1960, p. 175). He went on to assert that man has both an innate 
"disposition and capacity for improving his condition," and that this "has 
everywhere produced a remarkable uniformity in the several steps of his pro- 
gression.” 

In his analysis, Millar linked such diverse features of societies as settlement 
pattern, type of polity, degree of social inequality, marriage patterns, modes 
of family life, and other matters to the various steps or stages. In his analysis, 
Millar alternated in the terminology he used. Sometimes, he wrote of savage, 
barbarian, and civilized societies, while at other times he referred to hunters, 
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pastoralists, and farmers, but he never left readers in doubt concerning the 
equivalence of the two sets of terms. In Millar's writings, as in the writings of 
a number of his contemporaries, one sees the emergence of a striking new 
taxonomy of human societies, based, not on the older constitutional distinctions 
of Plato and Aristotle or on the ideational distinctions of Vico, but on tech- 
noeconomic criteria. 

Adam Ferguson, another influential contributor to the Scottish Enlighten- 
ment, also employed the new technoeconomic taxonomy of societies, but in 
his volume, An Essay on the History of Civil Society (1767), he put greater 
emphasis on the role of property as a derivative, but critical, element differen- 
tiating savage from barbarian societies. Here, Ferguson anticipated, and laid a 
foundation for, a central element in Marx and Engels' analyses in the next 
century. 


NINETEENTH AND EARLY TWENTIETH CENTURY 
TAXONOMIES 


In the nineteenth and early twentieth centuries, both the materialist and idealist 
traditions found vigorous expression in the social sciences. At the same time, 
societal taxonomies gradually became more sophisticated. 

Tn the early decades of the century, Saint-Simon, drawing on ideas developed 
by Vico and others, posited a broad overriding trend in human history from 
anthropomorphically theistic interpretations of the world to scientific natural- 
ism (“Martel,” Int. Encycl. Soc. Sci. 1968, 13:592). His ideas greatly influenced 
Auguste Comte, who became so important in the development of French 
sociology. Comte adopted and expanded Saint-Simon's "law of the three 
stages," making it central in his work, and thus created a radically different 
kind of taxonomy from that developed by Millar, Ferguson, and the eighteenth 
century materialists. For Saint-Simon and Comte, as for Kant and Hegel, 
developments in ideology were assumed to be the critical forces responsible 
for major societal transformations. 

Reacting against the idealist tradition, especially as it found expression in 
Hegel, Marx and Engels developed a militantly materialist theory of societies 
and their development. In The German Ideology, written just four years after 
the publication of the last volume of Comte's Cours de philosophie positive, 
the youthful Marx and Engels asserted in a now famous passage: 


Morality, religion, metaphysics, all the rest of ideology and their corresponding 
forms of consciousness. . . [have no] semblance of independence. They have no 
history, no development; but men, developing their material production and their 
material intercourse, alter, along with this, their real existence, their thinking, and 
the products of their thinking. Life is not determined by consciousness, but con- 
sciousness by life. (Marx & Engels 1959 [1846], p. 247) 
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They, then, went on to state: “The first necessity therefore in any theory of 
history is to observe this fundamental fact in all its significance and all its 
implications, and to accord it its due importance” (Marx & Engels 1959, p. 249) 

Applying this basic principle, Marx and Engels gradually developed over 
many years their own distinctive taxonomy of social and economic formations. 
Employing the mode of production (rather than the means of subsistence used 
by eighteenth century materialists, such as Millar) as their basic criterion, they 
identified in various of their writings (including manuscripts that Marx did not 
intend for publication and which were never published in his lifetime) eight 
or nine distinctive social and economic formations: (i) primitive communism, 
(ii)(a) the Asiatic mode of production, (ii)(b) the classical ancient mode, (ii)(c) 
the Germanic mode, (ii)(d) the Slavonic mode, (iii) feudalism, (iv) capitalism, 
(v) socialism, and (vi) advanced communism. 

Viewed as a whole, the basic criteria underlying the Marxian taxonomy 
were (i) the level of technoeconomic development, and (ii) the presence or 
absence, and the nature, of the property system. With respect to technoecono- 
mic development, the formations constitute a progressive continuum (Marx 
1970 [1859], p. 52), except that the Asiatic, classical ancient, Germanic, and 
Slavonic formations are treated as "alternative routes out of primitive commu- 
nal society” (Marx 1964 [1857—1858], p. 32). Beyond this, primitive commu- 
nism, socialism, and advanced communism are differentiated from the other 
formations by the absence in them of the institution of private property. These 
other formations differ from one another not only in their levels of tech- 
noeconomic development but also in the nature of their property systems and 
in the resulting differences in class structure, class consciousness, and patterns 
of class conflict. Socialist and advanced communist societies, which did not 
yet exist in Marx and Engels' day, but whose emergence they predicted, would 
differ from one another, they asserted, chiefly in terms of levels of social 
consciousness and in the manner in which the social product would be distrib- 
uted within them. In socialist societies, until the new socialist man became the 
norm, material incentives would be required and it would be necessary to 
reward people unequally, but in proportion to the value of their work to society. 
Soon, however, as the corrupting influences on consciousness of the older 
capitalist order disappeared, moral incentives would replace material incen- 
tives, and the social product would be distributed "to each according to his 
need." 

While the ideas of Marx and Engels have had an enormous impact both 
politically and intellectually in the twentieth century, in their own lifetime their 
influence was in no way comparable to that of Herbert Spencer, for whom the 
history of human societies was a continuing process of evolution from an 
incoherent homogeneity to a coherent and coordinated heterogeneity. This 
view led him to a simple, yet distinctive, taxonomy of societies: (i) simple 
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societies, (ii) compound societies, (iii) doubly compound societies, and (iv) 
trebly (or more) compound societies (1876, Vol. 1, pp. 569-76). Each of these 
taxons was then subdivided on the basis of (a) its mode of governance and (b) 
its permanence of settlement. For example, simple societies were subdivided 
into (al) those that were headless, (a2) those with occasional headship, (a3) 
those with vague and unstable headship, and (a4) those with stable headship. 
These, in turn, were subdivided into (b1) nomadic societies, (b2) semisettled 
societies, and (b3) settled societies. 

Cross-cutting this basic taxonomy, Spencer developed an important distinc- 
tion between (i) militant societies and (ii) industrial societies (1876, Vol. 1, 
pp. 576-96). This distinction was based on differences “between the kinds of 
social activity which predominate” within societies, and “on the resulting 
unlikenesses of organization.” In militant societies, military activities and 
military institutions take precedence over economic activities and economic 
institutions, while in industrial societies the reverse is true. These two kinds 
of societies could exist at any level of evolutionary development from simple 
societies to trebly compound societies. Spencer also believed that not all 
societies fell neatly into these two categories: many occupied an intermediate 
position with neither set of activities or institutions predominating. 

From the standpoint of lasting scientific impact, the most important of the 
new nineteenth century taxonomies may well have been that developed by a 
Danish museum curator, Christian Jurgenson Thomsen. Faced with the task 
of organizing a rapidly growing collection of prehistoric artifacts, Thomsen 
hit upon a solution that has become the foundation for virtually all theory and 
theory-oriented research in the important discipline of archaeology. He devel- 
oped a taxonomy based on the kinds of materials used in tools and weapons: 
stone, bronze, and iron. 

Subsequent generations of archaeologists have refined Thomsen’s initial 
formulation so that today one encounters at least eight distinctive taxons: (i) 
Lower Paleolithic, (ii) Middle Paleolithic, (iii) Upper Paleolithic, (iv) Meso- 
lithic, (v) Neolithic, (vi) Chalcolithic, (vii) Bronze, and (viii) Iron societies. 
As with the Marxian taxonomy, the labels apply to historical epochs as well 
as to societies, and they constitute a progressive sequence. 

In contrast to the empirically grounded taxons developed by Thomsen, 
Gemeinschaft and Gesellschaft, creations of Ferdinand Tönnies (1887), were 
intended to be no more than sensitizing ideal types, not to be found in pure 
form in the real world (Heberle 1948, p. 234). Real life institutions and societies 
always combine elements of both types, and the task of the sociologist, ac- 
cording to Tönnies, is to discover to what extent, in what forms, and with what 
consequences each type exists. Gesellschaft arises in response to the exercise 
of Kurwillen, or rational will, which differentiates between means and ends; 
Gemeinschaft is a product of Wesenwillen, or emotions and inclinations. 
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Tönnies’ distinction between Gemeinschaft and Gesellschaft has become a 
fixture in the sociological tradition. Often, however, the concepts have been 
reified or used as academic jargon while referring to things that are obvious even 
to laymen. Thus, it would be easy to dismiss the distinction as mere pedantry. 

There is, however, more to be said for the distinction and Tönnies’ use of 
it. In contrast to the distinctions developed by many of his predecessors, 
Tónnies' concepts serve as a caution against the all too easy assumption that 
change is synonymous with progress, or that it is progressive in all its mani- 
festations. The very word that Tönnies used to refer to the older mode of 
organization, Gemeinschaft, is a word with strong positive connotations. In 
this respect it is quite unlike words, such as “savage,” and “barbarian,” or even 
more connotatively neutral terms, such as "simple" and “compound,” or "Asi- 
atic,” “ancient,” and “feudal.” In short, Tonnies’ concepts contain an implicit 
reminder that even developments that are progressive in many respects come 
at a price and can necessitate costly and unwelcome trade-offs. 

Barely six years after Ténnies published Gemeinschaft und Gesellschaft, 
Emile Durkheim published his equally influential volume, De la division du 
travail social (1893). It is surprising, therefore, to find no reference to Ténnies’ 
work in Durkheim’s volume, since both shared a common concern with the 
long-term transformation of societies and conceptualized it in more or less 
dichotomous or polarized terms. 

Although Durkheim has come to be regarded by many as one of sociology’s 
holy trinity, and although De la division du travail social is commonly accepted 
as one of his major works, his distinction between mechanical and organic 
solidarity has not proven very useful from the standpoint of either research or 
theory. Though regularly mentioned in textbooks on the history of sociological 
theory, it is difficult to see that his taxonomy has generated much innovative 
research or led to significant advances in theory. 

The final member of sociology’s honored trinity of macrosocial theorists, 
Max Weber, developed at least two important societal taxonomies. Reacting 
against what he believed to be the overly materialist theories of Marxists and 
others (e.g. Millar, Ferguson), Weber (1920-1921) reasserted the importance 
of belief systems, though in a somewhat different manner from earlier writers 
such as Vico, Saint-Simon, and Comte. In a far-ranging effort to understand 
the forces responsible for the rise of northwest Europe to the unique position 
of wealth and power that it had come to enjoy by the early twentieth century, 
Weber shifted the focus of analysis from the means of subsistence and/or mode 
of production to the religious ideas that differentiated one region from other 
parts of the civilized world. For him, Puritanism, Catholicism, ancient Judaism, 
Confucianism, Buddhism, Hinduism, Islam, and modern secular rationalism 
each had profoundly influenced the societies in which they were dominant, 
and each had given rise to a unique and distinctive social and economic order. 
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The other major Weberian taxonomy (Weber 1925 [1968]), based on the 
concept of power, took three basic forms: (i) coercive power; (ii) market power; 
and (iii) authority. The latter also took three forms: (iii)(a) charismatic author- 
ity, (iii)(b) traditional authority, and (iiiXc) rational-legal authority; traditional 
authority, in turn, could take the form of either: (iii)(b1) patrimonialism or 
(iii)(b2) feudalism. In part, this second taxonomy cross-cut the first, but in part 
the two coincided. Charismatic authority and traditional authority have been 
present at one time or another in all of the older religiously based social 
formations. In contrast, the rational-legal mode is dominant only in modern 
secular societies. 

In addition to the taxonomies he developed, Weber also contributed signif- 
icantly to our modern understanding of social and economic formations 
through his discussions of the concept of ideal types. Here, Weber followed 
Tönnies and anticipated the probabilistic mode of thought, which has devel- 
oped in response to the quantitative revolution in the social sciences and has 
so greatly influenced research and theory in the second half of the twentieth 
century. As he stated in Economy and Society, 


[T]he same historical phenomenon may be in one aspect feudal, in another pat- 
rimonial, in another bureaucratic, and in still another charismatic. In order to give 
a precise meaning to these terms, it is necessary for the sociologist to formulate 


pure ideal types of the corresponding forms of action.... But precisely because 
this is true, it is probably seldom if ever that a real phenomenon can be found 


which corresponds exactly to one of these ideally constructed pure types. (English 
translation, p. 20) 


Earlier than most, Weber recognized the dangers of reifying taxonomic 
concepts and the hazards of deterministic formulations of theory. While he 
was not the first to recognize these dangers (see, for example, Marx, Engels, 
Spencer, Tyler), his use of the term “ideal types” focused attention on the 
problem in an especially effective manner. 

Although many new and useful taxonomies were developed in the nineteenth 
and early twentieth centuries, the older eighteenth century distinction between 
savage, barbarian, and civilized societies was never abandoned. On the con- 
trary, it continued to be utilized, especially by the founders of the newly 
emerging discipline of anthropology. 

Lewis Henry Morgan, the early American anthropologist, provides a good 
example of this. In his famous volume, Ancient Society (1877), which so 
impressed Marx and Engels, Morgan adopted, but also expanded on, the 
eighteenth century taxonomy. Thus, he subdivided both the savage and bar- 
barian taxons into three separate components, thereby creating a seven part 
developmental or evolutionary taxonomy of both historical epochs and socie- 
ties. 

In addition to expanding the older, less nuanced taxonomy, Morgan’s work 
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is noteworthy because of its utilization by several of the pioneers in systematic, 
quantitative research. The Dutch sociologist, HJ Niebohr (1900, 1910), ex- 
pressly acknowledged his indebtedness to Morgan’s taxonomy in his important 
early comparative study of slavery. Hobhouse, Wheeler, and Ginsberg (1915), 
in turn, acknowledged their indebtedness to Niebohr in their even more im- 
portant and influential study of the material culture and social institutions of 
preliterate societies. The unambiguous nature of Morgan’s taxons and tax- 
onomies, together with their considerable explanatory power, were almost 
certainly the reasons for their choice by Niebohr, Hobhouse, and other pioneers 
in quantitative macrosociology, and these two characteristics continue to make 
taxonomies based on subsistence technology especially attractive (see Sum- 
mary and Conclusions, below). 


DEVELOPMENTS SINCE WORLD WAR I 


In the period between the two world wars, the most important contributions 
to societal taxonomy were made by V. Gordon Childe, the distinguished British 
archaeologist, In a volume entitled The Dawn of European Civilization (1925), 
Childe advanced the thesis that much more was involved in the transition from 
Paleolithic to Neolithic than had previously been supposed. The Neolithic was 
not simply a time when stone tools and weapons became inexplicably smaller 
and more diversified. In Childe’s words, this was the point in history when 
man became the “master of his own food supply through the possession of 
domestic animals and cultivated plants...” (1958 [1925], p. 1). 

In effect, this insight enabled Childe to create a powerful synthesis of the 
archaeological tradition of taxonomies derived from Thomsen, based on ma- 
terials used in constructing tools and weapons, and the anthropological tradi- 
tion of taxonomies, derived from Enlightenment writers, based on the means 
of subsistence. Childe’s synthesis, it should be noted, was not some simple- 
minded or irenical blending of the two traditions, but one that asserted the 
preeminence of the means of subsistence. As he made increasingly clear in 
later writings, notably Man Makes Himself (1936) and What Happened in 
History (1942), the dawn of the Neolithic marked the start of the first great 
revolution in history, because the shift from food collection, or foraging, on 
which human societies had always been dependent in the past, to food pro- 
duction, on which most societies would thereafter depend, was the essential 
precondition for all of the radical changes that followed—growth in produc- 
tivity, growth of population, more permanent settlements, an increased division 
of labor, the emergence of towns and cities, increased trade, the rise of the 
state, growth in the scale of organization, the beginnings of literacy, the growth 
of inequality, the formation of classes, the rise of universal religions, and much 
more. Advances in subsistence technology—not the creation of the institution 
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of private property, or the development of compound societies, or the estab- 
lishment of a more complex division of labor, or any of the other factors 
proposed by other theorists and highlighted by other taxonomies—were the 
critical developments on which the most far-reaching and revolutionary trans- 
formations in human history were based. The developments that other theorists 
emphasized and other taxonomies highlighted were all derivative develop- 
ments. In short, Childe not only provided an invaluable synthesis of two of 
the more important taxonomies, he also formulated a powerfully parsimonious 
and persuasive explanation of the linkages among many of the most important 
structural and cultural components of human societies. 

Childe's striking thesis concerning the revolutionary consequences and 
enormous importance of the shift from foraging to food production that oc- 
curred in the Neolithic has gained increasing acceptance in the social sciences 
in the more than 60 years since he first propounded it. It is now an integral 
part of the dominant theoretical paradigm in archaeology and of one of the 
two leading paradigms in anthropology. Even in sociology, which after years 
of neglect is beginning to recover the broadly comparative and historical 
concerns and perspective of its founders, it has established a foothold. 

Childe also argued that there had been a second important revolution in 
the late prehistoric past—one that he labeled “the urban revolution." This 
revolution built on the foundation laid by the neolithic revolution. Farming, 
unlike hunting and gathering, enabled producers to produce more than was 
required to keep them alive and productive; it made possible a stable, 
storable, and transferable economic surplus. Thus, during the Neolithic, the 
new food producing societies were capable of supporting for the first time 
in history a class of people who could be freed from the necessity of 
providing their own food supply. Gradually, over centuries, a variety of 
methods were devised for separating the surplus from its producers, thereby 
enabling this new segment of society to grow in size, power, and affluence. 
For a variety of reasons, this new class tended to aggregate in certain 
communities, especially the places of residence of religious and political 
elites, thus giving rise to the first truly urban communities (i.e. communities 
whose residents were wholly or largely freed from the necessity of producing 
their own food supply). 

Childe made at least one other important contribution in the field of societal 
taxonomy. In a paper published during the second World War, Childe (1944) 
asserted that the various taxons employed by archaeologists since Thomsen's 
day should be thought of as technological stages, rather than as chronological 
ages. Thus, different societies could be at different stages of development at 
any given time: there was no necessary linkage between the two. This idea 
was more or less implicit in Childe's writings from the outset, but by making 
it explicit he put the focus where it properly belonged, on the key variable in 
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major societal transformations rather than on its imperfect chronological cor- 
relates. 

One might argue that Childe had, in effect, merely given the old savage/bar- 
barian/civilized taxonomy new labels derived from the archaeological tradi- 
tion. Although there is an element of truth to this, Childe put the taxonomy 
on a much sounder basis; his analysis infused what had often been a lifeless, 
Static taxonomy with a powerful and dynamic theoretical perspective, and 
Childe’s work also greatly facilitated the exchange of data and ideas between 
the previously largely separated disciplines of anthropology and archaeology. 

In the period following World War II, an American anthropologist, Walter 
Goldschmidt (1959), made a less well known, but important, contribution. 
Drawing on the wealth of anthropological research since Morgan's day, Gold- 
schmidt developed a taxonomy consisting of six basic types of societies: (1) 
nomadic hunting and food gathering societies, (ii) sedentary hunting and food 
gathering societies, (iii) horticultural societies, (iv) herding societies, (v) agri- 
cultural societies, and (vi) industrial societies. 

Two innovations in Goldschmidt's taxonomy have proven especially valu- 
able and have been incorporated into more recent taxonomies based on the 
means of subsistence. First, he differentiated between two types of farming 
societies, those that practiced slash-and-burn, or swidden, horticulture, a type 
of farming that requires the clearing of new gardens every few years, and plow 
agriculture, which permits the cultivation of fields on a more or less permanent 
basis. Second, he differentiated between preindustrial agricultural societies and 
industrial societies that also employ plow agriculture. 

Goldschmidt's distinction between horticultural and agricultural societies 
paralleled Childe's distinction between neolithic and urban societies. Neolithic 
societies became horticultural societies in Goldschmidt's taxonomy; Childe's 
urban societies became agricultural societies. Goldschmidt' s labels, however, 
put the focus much more clearly on the technological basis of major societal 
types and their transformations. By labelling the two great revolutions in 
prehistory the “neolithic” and "urban," Childe shifted the focus from the 
underlying technological causes to two of the archeologically more visible 
consequences. In view of Childe's insistence that the social and economic 
formations he had identified were technological stages in societal development, 
Goldschmidt's labels are actually more appropriate than were his own. 

Goldschmidt's taxonomy is also noteworthy because it took account of what 
most now recognize as the third major revolution in human history, the indus- 
trial revolution. To lump contemporary industrial societies together with other 
“civilized” societies, such as Ming China or Antonine Rome, as earlier writers 
had done, was no longer defensible. The new and vastly more complex machine 
technology, powered by enormous quantities of energy derived from coal, 
petroleum, natural gas, and other inanimate sources, has given rise to a radi- 
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cally new kind of society that is at least as different from agrarian societies of 
the recent past as the latter were different from horticultural societies. No 
taxonomy that ignores this can be regarded as fully adequate today, though it 
was, of course, more defensible a century ago. While Goldschmidt was hardly 
the first to recognize the revolutionary consequences of industrialization, he 
may have been the first to identify industrial society as a distinctive taxon 
within the framework of a truly comprehensive taxonomy of societies. 

In the decade following the publication of Man's Way, I developed a mod- 
ified version of Goldschmidt's taxonomy. In Power and Privilege: A Theory 
of Social Stratification (Lenski 1966) and, subsequently, in Human Societies: 
An Introduction to Macrosociology (1970), I adopted Goldschmidt's basic 
taxonomy while introducing several additional features. Specifically, I added 
two new taxons (i) fishing societies and (ii) maritime societies, and subdivided 
Goldschmidt’s hunting and gathering, horticultural, agricultural, and industrial 
taxons into less advanced and more advanced categories. In addition, I (1966, 
p. 93) introduced the concept of hybrid taxons to take account of societies that 
combine substantial elements of two or more major subsistence technologies, 
and in a subsequent paper, Nolan and I (1984) have shown that two of these 
hybrid taxons—industrializing horticultural and industrializing agrarian soci- 
eties—are useful tools for differentiating among contemporary Third World 
societies. These taxons help to explain a number of striking differences among 
them, including variations in their rates of economic growth. 

In explaining the causes of variations in societal type, I have argued that 
variations in subsistence technology are not the whole story. Subsistence 
technologies are products of the interaction of societies and their cultures with 
different kinds of environments, not all of which have the same potentials 
(Heise et al 1976, pp. 319-320; Lenski & Lenski 1987, pp. 81-83). In addition 
to noting the obvious linkages between societal type and environment evident 
in herding, fishing, and maritime societies, I incorporated the recent discovery 
that some ecosystems that are suitable for horticulture do not permit the 
indigenous development of plow agriculture (e.g. many in the tropics). Differ- 
ences in environment, therefore, interacting with differences in subsistence 
technology, have had a tremendous influence on the course of societal devel- 
opment throughout history, and even today they are responsible for major 
differences in the kinds of social and economic formations found in various 
regions (Lenski & Nolan 1984). Differences in the timing of advances in 
subsistence technology are also relevant: thus, the more recent introduction of 
agriculture in the New World, compared to the Old, makes the industrializing 
agrarian societies of Latin America today different in a number of theoretically 
important ways from the industrializing agrarian societies of Asia (Nolan & 
Lenski 1985). Finally, my taxonomy makes explicit the multilinear nature of 
societal evolution—an idea that has long been recognized by many, but seldom 
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incorporated into societal taxonomies (Lenski, Lenski, & Nolan, 1991, Figure 
4.3, p. 73). 

For much of the twentieth century, anthropologists in the United States and 
Britain were obsessed with kinship systems and differences among them. This 
was partly due to the abandonment of the evolutionary perspective that had 
previously provided the discipline with its dominant paradigm. Searching for 
a new paradigm, the majority of anthropologists turned to structural-function- 
alism. Under its influence, many came to see kinship systems as the key 
institution in societies, and they developed their taxonomies accordingly. 

This anthropological concern with kinship was not simply a by-product of 
the abandonment of the evolutionary perspective, however. It was also a part 
of the basic heritage of the discipline, and a reflection of the enormous im- 
portance of kinship in preliterate societies, those that anthropologists know 
best. This led to a variety of taxonomies, most of which seem remarkably 
sterile today except, perhaps, to those whose interests do not extend beyond 
the limits of preliterate populations. For those whose interests include the entire 
spectrum of human societies, and especially for those who are interested in 
the process of societal evolution, the value of taxonomies that differentiate 
between the Eskimoan, Hawaiian, Yunan, Fox, Guinean, Dakotan, Sudanese, 
Omahan, Nankanese, Iroquois, and Crow systems of kinship (Murdock 1949), 
and similar taxonomies, is minimal. 

Elman Service was one anthropologist who reacted against kinship-based 
taxonomies at a fairly early date. As a student of Leslie White, who, along 
with Childe, was one of the few proponents of evolutionism in the decades 
between the two world wars, Service recognized the serious limitations of 
taxonomies based on kinship. In Primitive Social Organization: An Evolution- 
ary Perspective (1962) and Origins of the State and Civilization (1975), he 
presented a taxonomy reminiscent of Spencer’s nineteenth century taxonomy 
of simple, compound, doubly compound, and trebly compound societies. Ser- 
vice’s taxonomy, like Spencer’s, is an organizational taxonomy based on 
variations in the territorial scope and intensity of the integrative organizational 
mechanisms in societies. 

Service identified four major societal types: (i) bands, (ii) tribes, (iii) chief- 
doms, and (iv) states. In his taxonomy, states are geographically more extensive 
and organizationally more integrated than chiefdoms, chiefdoms than tribes, 
and tribes than bands. As Service explained, in comparing tribes and bands: 
“A tribe is of the order of a large collection of bands, but it is not simply a 
collection of bands. The ties that bind a tribe are different from those of bands” 
(1962, p. 111). Service went on to state that whereas bands are held together 
by alliances based on reciprocity arising from the practice of exogamy and 
marital residence practices, tribes are unified by pan-tribal sodalities, such as 
clans, age-grade associations, secret societies, and sodalities for special pur- 
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poses, especially warfare, healing, and rituals. He added that there were prob- 
ably multiple routes or paths that lead from the band level of organization to 
the tribal. 

Chiefdoms, according to Service, are more productive and more densely 
populated than tribes, but their critical distinguishing characteristic is that 
they "are redistributional societies with a central agency of coordination" 
(Service 1962, p. 144). Finally, the distinguishing feature of states is their 
reliance on bureaucratic governance supported by legal force. (Service 1962, 
p. 175). ' 

Service, like many other taxonomists, recognized the existence of significant 
differences within his basic taxons. This led him to differentiate among three 
kinds of bands: patrilocal, composite, and anomalous. At the tribal level, he 
differentiated between lineal, cognatic, and composite tribes, and at the state 
level he differentiated between primitive states, such as those of the Ashanti 
and Aztec, and “classical” civilizations, such as those of the Chinese and 
Romans. 

In anthropology today, Service's taxonomy is probably used more widely 
than any other. It has also been adopted in recent years by numerous archae- 
ologists, especially in analyses of pre-Columbian New World materials. 

Not all twentieth century taxonomies have been so successful. A prime 
example of one that seemed promising when introduced, but is now largely 
ignored, is Pitirim Sorokin’s (1937) categorization of societal cultures as (i) 
ideational, (ii) sensate, (iii) idealistic, and (iv) mixed. Another largely unsuc- 
cessful taxonomy was that developed by Talcott Parsons (1966, 1971). In an 
effort to provide a more dynamic underpinning for his earlier, largely static, 
structural-functional work, late in life Parsons turned to evolutionary theory. 
In his analysis of the evolution of societies, he identified three major taxons: 
(a) primitive societies, (b) intermediate societies, and (c) modern societies. The 
second of these was subdivided into (i) archaic societies, (ii) advanced inter- 
mediate societies or historic empires, and (iii) seedbed societies. 

This taxonomy is strongly reminiscent of Saint-Simon's law of the three 
stages and of Comte's three stages. It also incorporates certain elements of the 
Weberian perspective. But despite its antecedents and Parsons' own enormous 
reputation, it has found little acceptance outside a narrow circle of his follow- 
ers. The explanation appears to lie in the substance of the taxonomy itself. The 
three basic taxons are remarkably crude and simplistic by comparison with 
other, more nuanced contemporary taxonomies (see Summary and Conclusions 
below). In effect, they are little more than “low,” “medium,” and "high" 
developmental categories. Furthermore, the criteria used to differentiate the 
taxons are neither particularly insightful nor diagnostic. Finally, the categories 
of "seedbed" societies and "modern" societies have an unfortunate western or 
Eurocentric bias. 
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PARTIAL TAXONOMIES FOR SOCIETIES OF THE 
MODERN ERA 


Up to this point, most of the taxonomies that have been considered were 
designed to cover the entire spectrum of human societies, from the most 
primitive to the most advanced. Before concluding this survey, we need to 
consider a number of taxons and taxonomies that have been developed in 
recent decades with a much more limited objective, namely, to differentiate 
among contemporary societies. These taxonomies are largely—though not 
entirely—products of the post-World War II era. Many are the creations of 
journalists, politicians, or bureaucrats, and they have become elements of 
popular culture. Nevertheless, most of them have become valuable—even 
essential—tools for social scientists. 

The first of these partial taxonomies emerged during the late nineteen-twen- 
ties as observers of the international scene became increasingly conscious of 
the distinctive features of the new regimes established in the Soviet Union 
under Lenin and Stalin and in Italy under Mussolini. This led to the concept 
of totalitarianism and a distinction between totalitarian and nontotalitarian 
societies. According to the Oxford English Dictionary Supplement of 1933, 
the earliest uses of the new concept in English were in articles by British 
journalists in 1928 and 1929. After Hitler's rise to power, the concept was 
extended to German society, and later it was applied to other societies con- 
trolled by communist and fascist regimes. 

Although this new taxonomy was based on a political criterion, it was much 
broader in conception. Totalitarian societies were ones in which a dominant 
political party imposed its own distinctive vision on most or all aspects of 
human life—from art and religion to politics and the economy. Even the 
sciences were not exempt. 

During the 1930s, 1940s, and early 1950s, the concept of totalitarianism 
was much in vogue in scholarly writing as well as in the mass media. Subse- 
quently, however, it gradually lost favor and now is seen only infrequently. 
The reason for this appears to be the changes that have occurred in societies 
rather than any intrinsic defect in the concept. Thus, the fascist regimes in Italy 
and Germany were replaced by democratic governments following World War 
II, and most communist regimes have either collapsed or been obliged to relax 
their more extreme controls. 

Responding to such changes, social scientists have come to make increasing 
use of the concept of "authoritarian societies." These resemble totalitarian 
societies in that control of government is in the hands of a small, closed, 
self-appointed, and often self-perpetuating political elite. In authoritarian so- 
cieties, however, these elites allow the rest of the population substantial latitude 
in art, religion, economics, and other areas of life. In current usage, the concept 
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of authoritarian societies is applied not only to a number of surviving commu- 
nist regimes, but also to societies controlled by undemocratic, right-wing elites. 

In the years following World War II, as the United States became increas- 
ingly involved in world affairs, one of the important developments in macro- 
sociology and other social sciences was the formulation of what came to be 
known as modernization theory. This gave rise to a popular distinction between 
traditional and modern societies. Drawing on Weberian ideas, modernization 
theorists argued that in traditional societies social and economic development 
was constrained by nonrational elements embedded in traditional ideologies 
(e.g. fatalism, belief in magic) which could only be overcome through the 
expansion of educational systems and the mass media. 

In 1960, Clark Kerr, together with several fellow labor economists, pub- 
lished Industrialism and Industrial Man, in which they emphasized the role 
of industrialization as the transforming force in the modern world. They argued 
that all societies, regardless of ideology, when exposed to the forces of indus- 
trialism respond similarly in a number of important ways, and that this lays 
the foundation for a process of social and cultural convergence. 

As labor economists, Kerr and his colleagues were especially concerned 
with the implications of industrialization for the labor force and the work place. 
Challenging Marx and Engels' vision of the working class under advanced 
industrial capitalism, Kerr et al (1964, p. 17) argued that industrialization, 
whether under capitalism or socialism, requires a skilled work force. Instead 
of homogenizing the working class, industrialization diversifies it; instead of 
degrading it, industrialization elevates the majority of its members by creating 
both the need and opportunity for increased education, and by improving 
working and living conditions. Kerr and his associates also argued that Marx 
and Engels were wrong in supposing that the state would inevitably be an 
instrument of oppression in capitalist societies or would wither away under 
socialism. Instead, in all industrial societies the role of government expands, 
and this expansion benefits the great majority of individuals, even if unequally. 
Thus, while ideologies have created differences among industrial societies, the 
compelling logic of industrialism tends, over time, to erode them. 

Not long after Kerr and his colleagues published Industrialism and Industrial 
Man, Radovan Richta and colleagues at the Czechoslovak Academy of Sci- 
ences published Civilizace na Rozcesti (1969), and Daniel Bell published The 
Coming of Post-Industrial Society (1973). The central thesis in both volumes 
was that advances in science and technology in the twentieth century have 
been transforming the nature of industrial societies in ways that make older 
conceptualizations, older views, and older policies inappropriate. 

Bell boldly proclaimed the coming of a new social order—post-industrial 
society—that would be as different from industrial society as the latter had 
been from the preindustrial society that preceded it. In the new societies, both 
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west and east, education would be increasingly important, and social scientists 
now often refer to it as a form of investment that generates “human capital.” 
For individuals and for societies, power and influence would be based increas- 
ingly on the control of this new form of capital, and systems of inequality 
would increasingly reflect it. 

The chief beneficiaries of these changes would be the rapidly expanding 
professional classes. Also, in the new post-industrial societies, machines would 
quickly eliminate the need for unskilled human labor, and the nature of the 
working class would be drastically transformed. In fact, it was argued, with 
each passing year the class structure of industrial societies bears less and less 
resemblance to that of the nineteenth-century societies on which Marx based 
so much of his analysis. Finally, Bell argued that values are changing and a 
new consumer society is emerging. 

Clearly, Richta, Bell, and others who followed their leads have captured an 
important aspect of change in contemporary society. Whether this change is 
comparable to the change from preindustrial to industrial society is a matter 
on which there will necessarily be differences of opinion. Such differences 
will depend in large measure on the time perspective employed: for those 
whose horizons are limited to the decades since World War II, the importance 
of this change, and the need for differentiating between industrial and post-in- 
dustrial societies, will loom far larger than for those whose perspective spans 
centuries or millennia. 

Even for those with the more limited perspective, however, there is reason 
to question Bell's identification of contemporary societies as “post-industrial.” 
Although a declining percentage of the labor force is employed in the manu- 
facture of goods (i.e. in industrial production), these societies remain industrial 
in the more fundamental sense that their entire way of life presupposes an 
industrial technology and an industrial mode of production. In fact, these 
societies are even more dependent on inanimate sources of energy and machine 
production than earlier industrial societies. Thus, some other label, such as 
"advanced industrial societies," seems more appropriate. 

During the 1960s, the optimism that had fueled the rise of modernization 
theory and its various offshoots began to wane, especially in parts of the Third 
World. One consequence of this was the emergence of dependency theory in 
Latin America in the 1960s and its distinction between the dominant core 
societies of the industrialized West and the so-called periphery of dependent 
societies. 

Building on the foundation laid by dependency theorists, Immanuel Waller- 
stein (1974) formulated world-system theory with its three-tiered taxonomy 
consisting of core states, semiperipheral areas, and peripheral areas. Classifi- 
cation of individual societies in Wallerstein's taxonomy, as in the taxonomy 
of dependency theorists, is based on the role and status of societies in the 
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capitalist world-system. Wallerstein also developed a second taxonomy. 
World-systems, he argued, take two forms: either they are world-empires or 
they are world-economies. In the former, social units are united politically (as 
in the Roman Empire), while in the latter they are integrated economically 
(1974, p. 15 f£). 

Although Wallerstein's theory and taxonomy enjoy considerable popularity, 
especially among American neo-Marxist sociologists, they have not escaped 
criticism. Two of the more serious appear to be (i) Wallerstein's rather pan- 
glossian view of socialism and socialist economies, and (ii) his tendency to 
minimize the role of indigenous intrasocietal forces in the developmental 
process in peripheral and semiperipheral societies (Chirot & Hall 1982). Even 
Latin American dependency theorists now give greater attention to indigenous 
forces. 

With the creation of the United Nations following World War II and with 
growing interest in the global community of nations, social scientists, states- 
men, journalists, and bureaucrats came increasingly to make comparisons 
between societies that were variously described as modern or developed and 
others that were referred to as backward or undeveloped. Gradually, however, 
the latter terms were replaced by euphemistic alternatives. Thus, undeveloped 
or backward societies became underdeveloped societies, and, later, less devel- 
oped societies. As the record keeping function of the United Nations expanded, 
its bureaucracy found it necessary to combine its developmental taxonomy 
with a political and ideological taxonomy that differentiated between societies 
with market-based economies and those with centrally planned economies. 
This mixed taxonomy came to be used widely, not only in statistical reports 
of the UN and other international agencies, but also in much empirically 
oriented writing in the social sciences. 

Another taxonomy that was of enormous importance in recent decades 
was a direct by-product of the political struggle between the Soviet Union 
and the western democracies. In an attempt to counteract efforts by the major 
powers to draw other nations into their respective camps, Tito, Nasser, and 
Nehru organized a conference of leaders of nonaligned nations and an- 
nounced the formation of a new, third bloc. Journalists and others quickly 
began referring to this bloc as the Third World. The western democracies 
were then said to form the First World and the nations of eastern Europe 
the Second. 

Originally, the term, "Third World," referred to a politically defined set of 
societies—societies that were not aligned with either of the superpowers. 
Gradually, however, the concept changed meaning. A number of less devel- 
oped societies with close ties to one or the other of the major superpowers 
were gradually admitted to membership in the nonaligned bloc with the result 
that the nature of the organization—and, with it, the nature of the concept— 
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gradually changed. As a result, the original meaning of “Third World" no 
longer applies. Today, the term is synonymous with “less developed.” 

For various reasons, the concept of the Second World and its role in the 
global system was never clearly established. Some who used this popular 
taxonomy appeared to differentiate between First and Second World societies 
almost entirely on the basis of political, ideological, and economic criteria (i.e. 
market economies vs centrally planned economies), while others also tended 
to treat Second World societies as intermediate between First and Third World 
societies in terms of economic development. Still others simply ignored the 
Soviet-bloc of societies. Now, however, with the collapse of communism in 
eastern Europe, the issue appears moot. 

Problems with this taxonomy were often aggravated by the fact that many 
who used it had a partially hidden political or ideological agenda. The concepts 
were often used more as weapons in debate or propaganda than as tools of 
analysis. Thus, although the concepts reflected an important aspect of reality, 
they were not without serious problems. 

Before concluding this survey of taxonomies of societies of the twentieth 
century, note must be taken of the struggles of Marxists to cope with the 
realities of contemporary social and economic formations. This has been a 
difficult and multifaceted problem for Marxists, Neither western capitalist 
societies nor the socialist societies that developed in eastern Europe and else- 
where conformed at all well to the models formulated by Marx in the nineteenth 
century. On the one hand, western capitalism has not given rise to the pro- 
gressive immiseration of industrial workers that Marx predicted, nor has it 
driven them to revolutionary consciousness. On the contrary, workers in these 
societies now enjoy a vastly higher standard of living than their predecessors 
of a century ago, and revolutionary consciousness is far more prevalent among 
intellectuals, actors, artists, journalists, and students than among industrial 
workers. In addition, democratic processes have at least partially transformed 
the state into an agency which, through welfare programs, serves as a protective 
mechanism shielding all classes, in varying degrees, against many of the 
harsher features of a market economy. 

To compound problems, Marxist societies have proven uniformly disap- 
pointing. Their dependence on harsh repressive measures was long a source 
of embarrassment to at least some of their more idealistic western supporters. 
Moreover, the state never withered away, but expanded enormously, economic 
inequality persisted and political inequality became extreme, efforts to shift 
from material to moral incentives failed badly, and, worst of all, the new 
socially conscious socialist man never emerged. In the end, most of these 
societies have been obliged to adopt many of the long-criticized arrangements 
and practices of capitalism. 

For all these reasons, Marxist theorists have been compelled to modify the 
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taxonomy inherited from Marx, if anything was to be salvaged from it. One 
“solution,” developed in the Soviet Union in the 1970s and 1980s, was to 
expand the original taxonomy by introducing the concept of "developed so- 
cialism" as an intermediate stage between socialism and advanced communism 
(Evans 1977). This was devised in an effort to explain the persistence of the 
state, the presence of substantial social inequality, and other theoretically 
inappropriate features of Soviet society. 

In a far more critical vein, some Marxists and Marxist sympathizers outside 
of the Soviet Union created a variety of other, less flattering concepts in an 
effort to distance themselves from the increasingly obvious failures of the 
Soviet Union and its satellites. These newer concepts included "state capital- 
ism" (Cliff 1974), "state socialism" (Lane 1976), "bureaucratic collectivism" 
(Schactman 1962), and "deformed workers states" (Trotsky 1937). Similar 
revisions have been introduced in an effort to salvage Marx's analysis of 
capitalist societies. These efforts underscore the seriousness of the problems 
inherent in both Marx's original theory and the taxonomy based on it. 


SUMMARY AND CONCLUSIONS 


Unlike the physical sciences, the social sciences have yet to agree on a taxon- 
omy for the most basic entities in their domain of study, human societies. They 
are not even as close to agreement as the biological sciences, where debates 
concerning many specific aspects of the taxonomy of species continue, but 
within a framework whose underlying principles and basic parameters are 
generally agreed upon. 

Given the large number of societal taxonomies that have been proposed over 
the years, and the continuing lack of consensus concerning their relative merits, 
the question arises: How does one choose among them? What principles should 
guide selection among them? And, finally, is there any compelling rationale 
for preferring one rather than the others? 

In choosing among alternatives, several criteria seem fairly obvious. First, 
if our concern is with a taxonomy that is suitable for a science of human 
societies, rather than with a taxonomy appropriate for some limited problem 
of research, we need a comprehensive taxonomy. Second, a science of human 
societies also requires an unambiguous taxonomy—one whose criteria are 
specified so clearly that they can be applied by different scholars with a high 
degree of reliability. Third, a carefully nuanced taxonomy is always preferable 
to a more simplistic one. 

Finally, there is a fourth criterion that is also important, but it is less obvious 
than the others. Here, the experience of the biological sciences is instructive. 
Early taxonomies of plant and animal species were based on what are now 
recognized as phenotypic characteristics of populations. With the rise of the 
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new science of genetics, however, the basis of the taxonomy shifted to geno- 
typic characteristics to take account of the newly acquired understanding of 
causal processes. In effect, by shifting the basis of classification from pheno- 
typic to genotypic characteristics, biologists transformed their taxonomy from 
one based on dependent variables to one based on the most powerful of the 
independent variables. 

There is good reason to believe that the adoption of this principle (i.e. 
classifying societies on the basis of the most powerful independent variable) 
would also prove of benefit to sociology. Unfortunately, however, there is still 
no consensus as to which, if any, set of societal determinants is most basic. 

If we look closely at this matter, however, we can see that differences in 
viewpoint generally reflect differences in the data sets with which their advo- 
cates work and about which they are most knowledgeable. Thus, those whose 
interests are limited to some single contemporary industrial society (i.e., the 
great majority of sociologists today), and even those whose interests extend 
to other industrial societies of the present day, tend to believe that differences 
in ideology and organization (especially political and economic organization) 
are the most important. This is hardly surprising, however, since the societies 
on which they base their view all share what is essentially a common tech- 
nology. 

In contrast, those whose interests extend more broadly tend to see matters 
differently. Those whose interests encompass the full range of human societies, 
from prehistoric times to the present, are much more likely to be impressed 
by the critical transforming role of technological advance on human life and 
human societies. In short, the more inclusive or comprehensive the data set 
with which social scientists work, the greater the importance they tend to attach 
to technology and technological change relative to other variables. This sug- 
gests that while various taxonomies may be appropriate to analyses of problems 
involving limited sets of societies with similar technologies, the taxonomy 
most appropriate for a general theory of human societies and for a science of 
human societies is one based on technology, and more especially subsistence 
technology. « 

The reasons for this, and the reason why technological advance, rather than 
ideological and organizational innovation, has generated the most revolution- 
ary social and cultural changes throughout history, are not hard to find. Sub- 
sistence needs are the most urgent and compelling of all human needs, and the 
technologies created in response to these needs therefore become an essential 
part of the basic framework of every society. It is within this framework that 
solutions to other problems and other needs then must be devised. Moreover, 
new, improved technologies cannot be conjured up on demand (witness the 
prolonged and still unsatisfactory attempts to find safer and less polluting 

energy sources and safer and better means of disposing of toxic and radioactive 
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wastes, to cite but two examples), and a return to older, previously abandoned 
technologies is usually impossible because of unacceptable costs in human life 
and welfare. Thus, societies become "locked into" existing technologies for 
extended periods until more satisfactory technologies can be devised. 

Constraints on organizational and ideological innovation are usually much 
less severe than those on technological innovation. Organizational arrange- 
ments and ideologies are far more easily adapted to prevailing technological 
constraints than subsistence technology can be adapted to organizational and 
ideological constraints. 

It seems no exaggeration to say that advances in subsistence technology are 
functionally equivalent to adaptive changes in a population's gene pool: new 
energy resources and new materials enable populations to do things that they 
could not do before. This means, in effect, that while changes in organization 
and in ideology are comparable to the rearrangement of the furniture in a house, 
advances in subsistence technology are like addition of a new wing or even 
of the replacement of a single family residence by a multifamily apartment 
house or hotel. Putting matters more precisely, changes in organization and 
ideology can lead to new and innovative uses of existing resources, but ad- 
vances in subsistence technology make totally new resources available (e.g. 
new sources of energy, such as petroleum or nuclear energy; new kinds of 
materials, such as metals or plastics; or new kinds of machines, such as steam 
engines or computers), thereby overcoming historic constraints on human 
action and paving the way for radically new organizational possibilities and 
new ideological perspectives. 

Evidence of this is clear in the archaeological and ethnographic records. 
The shift from hunting and gathering to horticulture, for example, was a 
necessary precondition for the shift from nomadic to more permanent settle- 
ments in most parts of the world. Similarly, it was an essential precondition 
for the more than 1000-fold growth in human population in the last 10,000 
years (Hassan 1981, Table 12/3). These developments, together with subse- 
quent dependent advances in the technologies of transportation and commu- 
nication, were necessary preconditions for all of the new and more complex 
forms of social and economic organization and all of the new ideologies of 
the last five thousand years. Without the initial impetus provided by the shift 
from foraging to farming, and without subsequent technological advances that 
vastly expanded the economic surplus, human societies would still be much 
as they were during the late Stone Age, regardless of any ideological or 
organizational changes that might have been adopted. 

It would be a mistake to suppose, however, that the only justification for 
asserting the primacy of subsistence technology in the causal process at the 
societal level lies in the distant past when societies shifted from hunting and 
gathering to farming. Subsequent advances in subsistence technology contin- 
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ued to remove constraints on human action and to expand the range of possi- 
bilities, thus paving the way for what were previously impossible developments 
in social organization and ideology. In fact, the continuing (and accelerating) 
industrial revolution of the last two hundred years has been responsible for a 
vastly more rapid and no less profound transformation of human life than the 
earlier horticultural revolution. 

It is true, of course, that a process of feedback has always been operative. 
Some innovations in social and economic organization and some innovations 
in ideology have been essential to the continuation of advances in technology, 
and other innovations, while not essential, have made important contributions. 
But as Childe (1936, ch. 9) made clear in his provocative analysis of the ancient 
world, organizational and ideological innovations do not always have this 
effect: Many innovations in social organization and ideology stifle, rather than 
stimulate, technological advance and societal development, as the recent ex- 
perience of the Soviet Union and other Marxist societies reminds us. 

In summary, then, it appears that for a science of human societies, a mate- 
rialist taxonomy of the kind that has evolved out of the pioneering work of 
eighteenth-century writers, such as Turgot, Millar, and Ferguson, offers the 
greatest promise. A taxonomy based on subsistence technology combines 
comprehensiveness (a feature lacking in most competing taxonomies) with 
minimal ambiguity and maximal reliability. Equally important, it is based on 
a variable that explains more of the variance among human societies as a whole 
than any other (see Heise et al 1976, especially pp. 334-36). In addition, it 
lends itself readily to a process of modification and refinement as the need for 
that is indicated by new evidence or new developments, or the formulation of 
more highly nuanced categories. Thus, it has already been shown that the 
economically less advanced societies in the world today (i.e. those that are 
usually lumped together under the single rubric of “developing nations” or 
“the Third World”) can be fruitfully disaggregated on the basis of their tech- 
noeconomic heritages, and the timing of their transition from horticulture to 
agriculture, into three subsets with significantly different characteristics and 
different patterns.of economic and social development (Lenski & Nolan 1984, 
Nolan & Lenski 1985). 

In the years ahead, if sociologists begin to take the need for a societal 
taxonomy more seriously, our discipline could benefit in many of the same 
ways as other disciplines that have recognized the importance of a comprehens- 
ive and systematic mapping of the phenomena in their fields. The adoption of 
atruly comprehensive taxonomy of societies would almost certainly contribute 
to more cumulative development in theory and spur many fruitful new lines 
of research (i.e. research that would be more comparative and historical in 
nature). 

Such a development would greatly help to clarify the currently all too fuzzy 
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distinction between serious sociology on the one hand and pop sociology, 
journalism, and common sense on the other. By analyzing social phenomena in 
a theoretically rigorous and systematic comparative and historical framework, 
sociologists would move beyond the level of common sense and beyond the 
experience of intelligent laymen. In short, we would bring something new and 
distinctive to bear, something that the mass media and personal experience are 
unable to provide. A discipline that was able to put social phenomena (societal 
development, race and ethnic relations, crime, and all the rest) in a truly 
comprehensive comparative and historical framework would almost certainly 
command respect, since it would, of necessity, provide many new and valuable 
insights. It would also have the virtue of necessarily addressing change in a world 
in which rapid change has become the most important fact of life and the source 
of many of our major ills; and it would put the analysis of change in a global 
perspective in a world in which this is increasingly imperative. . 
But these benefits will not come without a price. Many of us will have to 
retool in much the same way that we have had to retool in recent decades to 
take advantage of the new opportunities provided by analytical statistics. We 
will have to expand our horizons and become more knowledgeable about 
societies that are spatially and temporally remote from our own. This will not 
be easy, but it could make the difference between a discipline that fulfills the 
promise of its founders and one that is all but indistinguishable from pop 
sociology and common sense. 
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Abstract 


The question of how to incorporate women in class analysis and stratification 
research has been the topic of heated controversy in recent decades. Much of 
the debate has been about the conventional approach to research on social 
mobility and class analysis that assumes the family to be the unit of stratifi- 
cation and the family’s class position to be determined independently of 
women’s work position. Those defending the conventional view can show that 
research on the empirical validity of the conventional view provides partial 
support for it, and that its use in previous empirical research probably has not 
resulted in serious misrepresentations. In this article, I review the literature on 
these issues. I summarize the criticism and defense of the conventional view 
and review research that examines its empirical adequacy. This is followed by 
a discussion of alternative approaches to the determination of the family’s class 
position. 


INTRODUCTION 


All classical theories of social stratification and social class share the funda- 
mental assumption that the family is the unit of stratification. This means that 
members of the same family are assumed to occupy a single position in the 
stratification hierarchy. Men, women, and children living together in a family 
all occupy the same social class; they are treated as social equals and are 
assumed to have similar interests, to share similar life chances, and to have 
the same standard of living. In particular, because members of the same family 
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are assumed to have similar interests, the family also is regarded as the basic 
unit in processes of class formation and in class action.! 

This so-called conventional view further assumes that the unitary nature of 
the family in terms of class position is unaffected by the nature of men's and 
women's economic roles. In particular, women's class position is independent 
of their own employment status, and the social position of the family is 
completely unaffected by women's work position. The family's position is the 
same whether the wife is a housewife, a secretary, a professor, or an unskilled 
worker. The housewife is seen as having the same life chances and interests 
as her employed spouse; women for whom an occupational career is important 
are assumed to have interests or life chances similar to those of their spouse, 
and children's life chances are assumed to be unaffected by their mother's 
employment situation or level of education. 

Different reasons have been given for seeing the family as the unit of 
stratification. Two reasons are especially important. One is the functionalist 
view that equivalent ranking of family members is needed for the family to 
perform one of its major functions, namely, ascription of status to children 
(Davis 1948:364). The other view is that the family must be the unit of 
stratification because only “certain members of the family have a directly 
determined position within the class structure" (Goldthorpe 1983:468). Be- 
cause of gender inequality, this member is the male in most households. 

The use of the family as the unit of stratification creates a theoretical 
problem: Is this a universal, or is the unit a matter of specific social conditions? 
In the latter case, the problem lies in specifying the conditions under which 
the assumption should be abandoned. Furthermore, if the family is maintained 
as the unit, then there is a measurement problem: how should the family's 
position in the stratification system be determined? The theoretical problem 
has not received much explicit attention, but the functionalist assumption that 
the unitary valuation of the family is a functional necessity (Parsons 1953:116— 
117) would seem to imply that the conventional assumptions never should be 
abandoned. However, the notion that gender inequality within and outside the 
family provides support for using the family as the unit (Goldthorpe 1983) 
suggests that gender inequality constitutes a condition for the conventional 
view, so that a disappearance of gender inequality would require that the 
assumption about the family as the unit be altered. 

The classical approach to the study of social stratification solved the mea- 
surement problem by measuring the family's class position by the work posi- 
tion of the male head of the household. This solution implied that women's 
class position by definition was derived from their husband's or father's class 


INone of the issues discussed here depends on the specific conception of class. I shall in the 
following use the term "class" quite loosely to refer both to Weberian and Marxist class conceptions. 
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position and that data on women's activities and work lives were irrelevant: 
Women were invisible in both theory and research on stratification and social 
class. Note that this solution does not necessarily follow from the assumption 
of the family being the unit: It is possible to assume that the family is the unit 
without measuring the status of the family by the work position of the male 
head. 

Changes in gender equality have caused many to reject the conventional 
view. Already in the 1960s, the increase in married women's gainful employ- 
ment had created a need for a re-evaluation of the procedure by which a 
family's class position is determined and an empirical assessment of the extent 
to which family members occupy the same position in the stratification hier- 
archy (Barth & Watson 1967, Watson & Barth 1964).? The challenge would 
beto determine how and to what extent women's employment actually affected 
the family's position in the stratification system, This suggestion of maintain- 
ing the assumption of the family as the unit, but changing the measurement 
of the status of the family, was not taken seriously by students of stratification 
and social class until many years later. 

The reaction to the conventional approach that did have a strong impact was 
the feminist critique, which took as its point of departure the exclusion of 
women from stratification research and the lack of attention paid to women's 
work and to gender inequality in the labor market. The feminist solution was 
8 complete abandonment of the assumption of the family as the unit of strat- 
ification and of class analysis (Acker 1973, Delphy 1984). The driving force 
behind this was the wish to include and make women visible in stratification 
research. According to this school of thought, the class position of individual 
men and women measured by their own work position should be the focus of 
research in social stratification. This recommendation spoke strongly to the 
sociological imagination in the 1970s, and innumerable studies of women's 
and men's occupational careers resulted. The individual became the preferred 
unit of analysis, and this change brought about the desired result: Women and 
women's employment became visible; gender inequality in labor market out- 
comes became an important area of sociological research; and our understand- 
ing of the mechanisms behind women's continuing disadvantage in the labor 
market vastly improved (e.g. Baron & Bielby 1988, DiPrete & Grusky 1990, 
England & McCreary 1987, Reskin & Hartman 1986, Reskin & Roos 1990, 
Rosenfeld 1980, Roos 1985, Sewell, Hauser & Wolf 1980). 


?7Svalastoga's study of occupational prestige in Denmark provides an early example of a 
departure from the conventional procedure for determining family's position. He assigned husband 
and wife the same occupational prestige except when she had an occupation ranking higher than 
that of her husband, in which case she was assigned a prestige ranking based on her own occupation 
(Svalastoga 1959:140). 
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Stratification theory and research was under fire for several years for its 
assumption that the family was the unit of class analysis. In 1983, John 
Goldthorpe (1983) responded with a vociferous defense of the conventional 
view. He rejected the charge that class analysis had ignored the existence of 
gender inequality (Acker 1973) and renewed the argument for taking the family 
as the unit of class analysis. Goldthorpe did not disregard inequality between 
the sexes in his analysis. On the contrary, women's disadvantaged position in 
the labor market and their continuing dependence on men were the main 
reasons for maintaining the family as the unit of stratification. Since members 
of a family living together typically share similar life chances and material 
conditions, women's life chances, standard of living, and "class fate" will be 
influenced much more by their husband's occupational position than by their 
own (or by the fact that they are women). The main reason for the dependence 
of women on men is their weaker attachment to the labor force, and the fact 
that their *employment typically forms part of a family strategy" (Goldthorpe 
1983:469). 

The question as to whether changes in women's employment should have 
any consequences for the determination of a family's class position was also 
addressed in Goldthorpe's defense. Goldthorpe made one admission at the 
outset, namely, that the occupational position of the head of the household, 
whether it be a man or a woman, should be seen as the best indicator of the 
family's social position. However, a mere increase in women's labor force 
participation is not sufficient to include women's occupational position in 
the determination of a family's class position. Goldthorpe reasoned that 
women typically remain dependent on men even when employed, as shown 
in studies of women's economic dependency (Sgrensen & McLanahan 1987). 
The only "truly problematic" situation envisioned by Goldthorpe was one 
where “the extent and nature of female participation in the labour market is 
such that in the more *normal' conjugal family it is increasingly hard to say 
whether the husband or wife should better be regarded as the family ‘head’ 
and that in many cases there are in effect two 'heads' with, quite often, 
different class positions" (Goldthorpe 1983:470). It would seem to follow 
that equality between the sexes in the labor market should constitute the 
conditions under which the conventional solution to the measurement of the 
family's social class should be abandoned or at least reexamined. This 
was, of course, the suggestion made by Barth & Watson already in the 
1960s. 

Goldthorpe's defense of the conventional view had a galvanizing effect on 
the research community. It provoked a lively theoretical debate (see papers in 
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Crompton & Mann 1986),? and it generated a great deal of empirical research 
which began to address the question of how women's new economic roles 
should affect the study of class and in particular the study of the family's class 
position in modern society. This line of research is the focus of this review. 
Yet, before turning to the issues addressed in these studies, we need to clarify 
the issue of unit of analysis in stratification theory and research. 


INDIVIDUAL OR FAMILY? 


The increase in women's employment has made it common that two important 
dimensions of social class do not coincide. They are (i) the individual's rela- 
tionship to the labor market as expressed in his or her work situation. and (ii) 
the family's relationship to the consumption market, or, its market situation 
(Dale et al 1985, Erikson 1984). In other words, when a high proportion of 
women are employed throughout the life course, it is more likely that it may 
be necessary to distinguish analytically between the family's class position 
and the class position of each of its members in the labor market. If families 
have only one person in gainful employment, this person's work situation 
probably is a reasonably good indicator of the family's lifestyle and standard 
of living and of the family members' interests and life chances.* It is much 
less evident that this is the case when both partners in a family occupy positions 
in the labor market. Women's new economic roles have thus brought an end 
to the situation where the class structure could be analyzed by studying men's 
positions, and where the study of the social class men occupy via their position 
in the labor market was equivalent to studying the class position of families. 

This means that research on the class position of individual men and women 
asks questions that differ from those asked in the research on the class position 
of families. In other words, the choice of unit of analysis in stratification 
research and class analysis reflects the substantive concern of one's research. 


3For summaries of this debate, see Abbott & Sapsford 1987, Crompton 1989, Lewis 1985, Wright 
1989. 

‘This naturally assumes that social class can be adequately indexed by position in the labor 
market. This is assumed in most studies of stratification and class in industrial and post-industrial 

5This research practice is the source of much misunderstanding in the debate over the proper unit 
of analysis in stratification research. Much of this debate (see c.g. Acker 1973, Delphy 1984, 
Crompton & Mann 1986, Crompton 1989, Goldthorpe 1983, Stanworth 1983) stemmed from the 
fact that researchers, such as Blau & Duncan (1967), Goldthorpe (1980), and Erikson & Goldthorpe 
(1992) used men’s occupational experiences as indicators of their family’s social class and men's 
intergenerational occupational mobility as an indicator of the mobility regime in society, and that 
others, notably researchers interested in women’s occupational careers and gerder inequality in the 
labor market, saw these as studies of individual men’s social class and occupetional mobility. 
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It is useful to distinguish between three types of concerns. The first has to do 
with research on the class position of individuals, the second with the family’s 
class position, and the third with the influence of the family on the individual's 
class position. 

When the concern is to provide a map of the class structure understood as 
class positions in the system of production (e.g. Wright 1985), or with gender 
inequality in labor markets (Roos 1985, Rosenfeld 1980, Wolf & Fligstein 
1979), then the appropriate unit of analysis is the individual. Similarly, research 
on men's and women's occupational mobility chances is concerned with 
changes in the class positions that individual women and men occupy via their 
position in the labor market. This can be research on intergenerational mobility 
from father (or mother) to son and daughter (Abbott & Sapsford 1987, Dex 
1990, Marshall et al 1988, Miller & Hayes 1990, Rosenfeld 1978) or over the 
work life (Carroll & Mayer 1986, Rosenfeld 1980). In this research, women's 
class position is determined in the same way as it is for men, that is, it is 
typically based on a coding of occupations and occupational characteristics 
into a class scheme, a status scale or a prestige scale. The inclusion of women 
in this research is straightforward and is a clear outcome of women's new 
economic roles. 

Research on those aspects of the stratification or class system that are linked 
to the pooling of resources and the sharing of living conditions clearly must 
consider the interdependence among the members of the group that shares 
resources and living conditions. This group is usually the family. In research 
on the family's material conditions of life, its lifestyle, its interests, and the 
life chances it offers, especially, the children, the family should be taken as 
the unit of analysis, and its members should be assumed to share the same 
class position. The measurement of the family's class position is a major task 
here. This is the measurement problem that has been made considerably more 


SThe inclusion of women in this research has led to concerns over whether measures of class, 
socioeconomic status, or prestige that were developed largely with reference to men's occupations 
need to be adjusted to properly reflect women’s position. There are two different issues in this regard. 
One is that some occupations, occupied largely by women, such as many of the new jobs in the 
expanding service sector, may be difficult to fit into an existing class scheme. For example, in the 
Erikson-Goldthorpe class scheme, low-skill, nonmanual, lower grade positions are grouped together 
with other nonmanual employees when only men are studied, but when women are included, the 
lower grade nonmanual positions are made into a separate class (Erikson & Goldthorpe 1992:44). 
This is to ensure that cach class remains relatively homogeneous with respect to work and market 
situation. The second issue is that men and women occupying the same position, and thus coded as 
being in the same class or having the same status, may in reality not reap the same benefits from the 
position, whether in the form of economic security, level of income, authority, status, or prestige. 
Tf that is the case, then the degree of within-class heterogeneity may be higher when women are also 
included. 
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difficult and controversial by women's new economic roles, as the previous 
discussion has shown. 

The pooling and sharing of resources in families is likely to have conse- 
quences for an individual's resources and labor market participation, that is, 
their individual class position (Curtis 1986). This is a basic idea in the research 
on the interplay between the family of origin and individual career trajectories 
in status attainment. The study of the influence of women's family situation 
on their employment status and occupational careers has been especially im- 
portant in the study of women's careers, but the family clearly may also 
influence men's class position (Wright 1989). The main task in this type of 
stratification research is thus to study the ways in which the pooling and sharing 
of labor and resources in the family influence the work position of individual 
men and women. 

In this review I primarily focus on the question of how to characterize the 
family's class or status. This is a central question in the line of research where 
the unit of analysis remains the family, that is, research about class analysis 
and class formation where "class structure" is the dependent variable. This 
research tradition is also where researchers have attempted to deal most directly 
with the problems that women's employment may create for the measurement 
of the family's class position. Research on the consequences of the family's 
class position for the individual's resources and work positions clearly also 
needs measures of the family's class. However, this body of research has not 
paid much explicit attention to the problem of measuring the family's status 
or class. 

In the next section, the focus is on the empirical evidence for the need to 
change the conventional approach. Two types of studies are reviewed. One 
type consists of studies of individual class "behavior" in the form of subjective 
class identification and voting preferences; the other type consists of studies 
of social mobility. In the subsequent section, I discuss proposals for alternative 
measures of the family's class position, measures that in various ways relax 
the assumptions of the conventional approach and include information about 
women's work position in the measurement of families’ class position. I 
conclude with a discussion of the implications of the research on the empirical 
adequacy of the conventional approach, and I propose an agenda for future 
research strategies. 


EMPIRICAL EVIDENCE AND THE CONVENTIONAL 
VIEW 


Two research strategies have been used to show that women's employment 
represents a serious challenge to the conventional approach. The first strategy 
is to test the null hypothesis that a conventional measure of family status 
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explains as much of the variance in indicators of class behavior as do alterna- 
tive measures. Hence, a rejection of the null hypothesis is a rejection of the 
conventional approach. The other strategy used to argue for or against the 
conventional approach is to compare the results of mobility analyses using 
different measures of class position. The conventional approach is again taken 
as the baseline, and the null hypothesis is that the conventional measure of the 
family’s class produces results that are similar to those found with alternative 
measures of family status. 


Class Identification and Voting 


Several studies have tested the hypothesis that the class identification and 
voting preferences of women and men are influenced not only by their own 
class position but also by that of their partner’s. Support for this hypothesis 
can be taken as evidence of the inadequacy of the conventional way of mea- 
suring the family’s status by using information only about the (male) head of 
household. Studies of class identification provide mixed results, some in favor 
of the conventional approach, others not. Interpreted by researchers to be 
strongly in favor of the conventional approach is a recent study using data on 
class identification from eight different surveys, where it is shown that “the 
class identification of married women does usually have a stronger association 
with their husband’s class position than with their own” (Erikson & Goldthorpe 
1992b:99). A similar conclusion was reached with respect to Norway (Hernes 
& Knudsen 1987) and the United States, where “the class character of the 
wife’s job seems to have no effect on the class identification of either men or 
women” (Wright 1989:52, Hammond 1987). 

Other studies using data from Sweden and Norway have interpreted their 
results differently. Working-class identification was higher among working- 
class women married to men in the working-class than it was among working- 
class women married to middle-class men (Leiulfsrud & Woodward 1989:79). 
A similar difference was observed for working-class men, although the differ- 
ences typically are much smaller for men than women. In other words, the 
effect of the class position of the spouse was stronger for women than for men. 
Equally important, middle-class men married to working-class women were 
much less likely to identify with the working class than were middle-class 
women married to working-class men (Wright 1989). It should be noted that 
some of the data used in these studies also were included in the analysis by 
Erikson & Goldthorpe (1992b), where the opposite conclusion was reached. 

A comparison of class identification in the 1970s and 1980s in the United 
States suggests that support for the conventional view may have weakened 
over time as women remain in the labor force throughout most of their lives 
and families expect both husband and wife to be gainfully employed. In the 
1970s, women's class identification was influenced only by the husband's 
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position, but in the 1980s both their own and their husband's position had a 
significant influence (Davis & Robinson 1988). 

The main impression left by studies of subjective class identification is that 
married women's employment does make a difference, but that this difference 
is small, although possibly increasing. 

In studies of voting behavior the conventional approach also fares quite 
well. A British study of voting intentions showed that *a woman's own occu- 
pation contributes no extra useful information once the social class of her 
husband is determined" (Marshall et al 1988:72), and a similar conclusion was 
reached in an analysis of voting intentions in Denmark, West Germany, and 
Sweden (Erikson & Goldthorpe 1992b:Table 2). A considerably more complex 
picture is presented in a recent analysis of British data (De Graaf & Heath 
1992). The overall picture of the relationship between voting behavior and the 
partner's own class position was rather similar to that presented in earlier 
studies: The man's social class dominated the voting intentions of both part- 
ners. No support was found for Erikson's (1984) dominance hypothesis—there 
was no evidence that the partner with the dominant position determined the 
voting intentions of both partners—and no support was found for hypotheses 
about a trend toward greater influence of the woman's position. However, a 
more detailed analysis showed the existence of significant interaction effects. 
Women in the higher and lower service class and in blue-collar occupations 
were influenced more by their own class than by their partner's when they 
expressed their voting intentions. Women in the petty bourgeoisie and the 
lower white-collar occupations were more likely to vote according to their 
partner's class than their own. In sum, the conventional approach was perfectly 
acceptable as a way to determine the family's class position for some families, 
but for families in which the women worked in professional or semiprofes- 
sional jobs, or in blue-collar jobs, the conventional approach was less than 
satisfactory. Women in these classes behaved more like men in the sense that 
their voting behavior was influenced more by their own class position than by 
their partner's; the effect of their own position was three times greater than 
was the effect of their partner's class. 

On one level, the conclusion of this study supports the conventional ap- 
proach. For British couples taken as a group, the expectation of the conven- 
tional view is well supported by the data: both partners’ voting behavior is 
more closely associated with the man's class than with the woman's. The 
exceptions to the rule do, however, seem to be of importance because they 
suggest that under some circumstances, women's class position begins to 
matter more for their voting behavior. This in turn could mean that the con- 
ventional approach should be replaced by some other measure of the family's 
class position, but the study does not make clear how such a measure should 
be constructed. 


36 SØRENSEN 


The results of the research on class identification and voting behavior add 
up to the following: consistent with the conventional view, most women’s 
class identification and voting behavior are more influenced by their husband’s 
class position than by their own class position. Contrary to the conventional 
view, women’s own class position often does matter a little, and for women 
in certain occupational classes it may matter a lot. There is then limited support 
for the hypothesis that women’s employment does make a difference for the 
class identification and voting behavior of husbands and wives; the question 
is whether this difference is large enough to justify a retreat from the conven- 
tional approach, that is, to reject the null hypothesis that the family's class 
position can be measured adequately by the class position of the male head of 
the household. 

With few exceptions, the performance of the conventional measure of the 
family's class has been assessed by testing whether the wife's class position 
adds significantly to the variance explained net of the husband's class (which, 
of course, is the conventional measure of the family's class). An alternative 
strategy has been to compare the strength of the association between the 
dependent variable and husband's and wife's social class. A much stronger 
association with the husband's class for both partners is then seen as support 
for the conventional view. There are two serious problems with these analytic 
strategies. First, they easily lead to different conclusions regarding the perfor- 
mance of the conventional approach, and, second, it is not clear what a rejection 
of the conventional view means; the alternative to the null-hypothesis is not 
well specified. Consider first the case where there is a significant net effect of 
the wife's class position on her voting intentions. This is contrary to the 
expectation under the conventional mode]; all that should matter is the family's 
class position as measured by the husband's class. If the first analytic approach 
is taken, the fact that there is a significant net effect is seen as sufficient 
evidence to reject the conventional view: women's class position does matter 
(Davis & Robinson 1988, Leiulfsrud & Woodward 1989). If the second ana- 
lytic approach is taken, the identical result may easily lead to the conclusion 
that the conventional view is acceptable, because women's class position does 
not matter very much compared to men's, or the errors that are introduced by 
using the conventional measure are small (Erikson & Goldthorpe 1992b; 
Marshall et al 1988). 

Consider next the case where there is agreement on a rejection of the 
conventional view. This could be based, for example, on the finding that the 
effect of own class position is more important for both men and women's 
voting behavior in selected classes (de Graaf & Heath 1992). Undoubtedly this 
is an important conclusion, but it is not at all clear what the preferred alternative 
is. The decision to reject the conventional approach does not rest on a com- 
parison of two alternative measures of the family's class position, but rather 
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on an evaluation of predictions under the null-hypothesis that the conventional 
view holds. 


Studies of Intergenerational Class Mobility 


The study of intergenerational class mobility is the study of the transmission 
of advantage and disadvantage from one generation to the next. The proper 
unit of analysis then is the family, since it is the study of mobility from the 
class of the family of origin to the class of one's own family (Ganzeboom et 
al 1991). It should be obvious that the measurement of the family’s class 
position is a very important task for the study of intergenerational mobility, 
and that there is a strong possibility for bias or misleading results if the family's 
class position is poorly measured. 

The defense of the conventional approach to the analysis of intergenerational 
mobility has taken two forms: (i) a rejection on theoretical grounds of proposals 
to include information about both partners in a measure of the family's class 
(Goldthorpe 1983), and (ii) demonstration of the empirical adequacy of the 
conventional approach (Erikson & Goldthorpe 1992a: Ch. 7, Goldthorpe & 
Payne 1986). 

Proposals to include information about both partners in a measure of the 
family's class position have been rejected because the family's class position 
might change every time one of the spouses changes jobs or moves in or out 
of the labor force. A so-called joint classification measure thus “greatly ac- 
centuates problems of defining class boundaries and ends to produce rates of 
class mobility that we would regard as spuriously high” (Erikson & Goldthorpe 
1992a:238). This is a curious criticism. The rationale for a joint classification 
measure has been that a family’s class position in fact is influenced by the 
work position of both husband and wife, that is the family’s material condi- 
tions, style of life, interests and attitudes, etc, indeed are thought to be affected 
by the wife leaving or entering the labor force, or the husband becoming 
unemployed, or either of them moving to a job in a different class. These 
expectations may not hold true (as we shall see later), but if they do, then the 
mobility that is “introduced” by the joint classification measure is true class 
mobility, and, as such may have serious consequences for the stability of 
classes and the formation of class consciousness and class action. The weak- 
ness of this criticism may best be illustrated by showing that the modified 
conventional approach may have similar consequences. This will be so if 


TAs noted earlier, it is important to distinguish this type of research from research that examines 
the relationship between an individual’s class position and the class of the family of origin, as it has 
been done in status attainment research or in studies of mobility comparing men or women’s class 
position with that of their father (or parents). In these studies the question is how the family of origin 
affects men and women’s own class position. 
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divorce rates are high. In that case, the class mobility over the life course will 
be high especially for women who would be in the class of their spouse as 
long as they are married and in their own class when they are unmarried. 
Again, it is difficult not to see these changes as true mobility, provided the 
measure of the family's class position is adequate. 

The empirical adequacy of conventional mobility studies is typically de- 
fended by showing that studies based on the class mobility of men are not 
misleading, or to put it differently, that the inclusion of information about 
women's class position makes no difference. A typical example of this strat- 
egy is the analysis of women's class mobility by Erikson & Goldthorpe 
(19922). Two questions are addressed, namely, (i) whether the structure of 
men's intergenerational mobility is similar to women's marital mobility, and 
(ii) whether the inclusion of the class position of female heads of households 
changes the structure of the mobility table. The conventional view assumes 
that the answer to the first of these questions is yes, and the answer to the 
second is no. This is not what Erikson & Goldthorpe found, although they do 
their best to downplay the importance of their findings. 

Several researchers have suggested that women are more mobile through 
marriage than men are through employment, or that women are less constrained 
in their mobility from one family class to another than men are (Chase 1975, 
Heath 1981). If that is the case, then the conventional approach will underes- 
timate the openness of the mobility regime because it ignores women's expe- 
riences. Erikson & Goldthorpe did find that women experienced more mobility 
away from their class origin than men (Erikson & Goldthorpe 1992a:255). 
They also found that in five countries out of ten the patterns of fluidity were 
different for men’s occupational mobility and women’s marital mobility, and 
that the association between origins and destinations was somewhat smaller 
for women than men (pg. 259). The authors found these differences sufficiently 
small to conclude that “if we know how men of a given class origin have 
themselves distributed within the class stucture in the course of their employ- 
ment, we can predict, with no great inaccuracy, how their sisters will have 
been distributed through marriage” (p.261). 

This optimism with respect to the conventional approach is more difficult 
to maintain in view of the results of a unique analysis of “complete” mobility 
tables, that is, tables representative of all households. The class origin was 
measured by the father’s class position, but the class destination was deter- 


Sit is important again to note that the question is not whether the inclusion of women’s class 
position makes a difference in studies of the class structure understood as positions in the system of 
production or the labor market, nor is the question whether men’s mobility chances are influenced 
by women’s peculiar location in the labor market. The answer to both of these questions is that the 
inclusion of women’s class position is very important for these types of studies. 
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mined on the basis of the dominance principle (Erikson 1984). This means 
that it is the spouse with the dominant class position that determined the 
family’s class position. The analysis showed that the male-only mobility table 
underestimated the chances of upward mobility, and that, in three of the four 
countries for which this analysis was done, the “pattern of social fluidity 
underlying the complete tables differs to some significant degree from that we 
have modelled in our ‘men-only’ tables” (Erikson & Goldthorpe 1992a: 
271-72). Specifically, the conventional approach seems to underestimate the 
amount of fluidity. The source of this difference is apparently not the inclusion 
of single women or women with a dominant position, but rather the fact that 
the effects of class origins in the petty bourgeoisie and farming classes differ 
for women and men. Similar results are reported for the United States, where 
it was concluded that conventional analyses have overestimated the amount 
and direction of intergenerational association in the mobility regime (As- 
chaffenburg 1992). 

These results are not consistent with the conventional approach, and they 
would seem to provide the clearest evidence so far for the need to change it. 
Nonetheless, Erikson & Goldthorpe are not willing to draw this conclusion, 
because they do not deem the differences as being of "any great quantitative 
importance" (Erikson & Goldthorpe 1992a:275). 

The empirical support for the conventional view is difficult to evaluate. It 
would appear to be rather strong when the dependent variables are measures 
of “class behavior,” such as subjective class identification or voting intentions. 
The available evidence here suggests that a majority of women are influenced 
more by their husband’s class than by their own class, and that men are not 
likely to be much influenced by their partner’s class. Women’s own class does 
have some effect on class identification and voting preferences, however. There 
is also evidence that in some families women are as much influenced by their 
own class as men are; this in turn means that the class position of these families 
is not well measured by using information only about the male head of house- 
hold. Results based on analyses of mobility tables also to some extent support 
the conventional view, i.e., basing mobility studies only on men’s mobility 
does not seem to introduce very strong biases or errors. Nevertheless, the most 
recent evidence does suggest that the male-only mobility table presents a 
picture of the intergenerational mobility process that is systematically different 
from the one resulting from an analysis that is representative of all households 
(i.e. includes female heads of households) and that allows the family’s class 
position to be determined by the dominant class position in the family. 

In light of these findings, it seems clear that there is a need for developing 
measures of the family’s class position that somehow take women’s employ- 
ment into account. Here follows a review of the recent literature on the 
determination of the family’s class position. 
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The basic problem in the measurement of the family's class position is that of 
combining different information about the work position of individual family 
members into one measure of the family's social class. Two types of measures 
have been proposed, measures of class based on the head of the household's 
own class position, and measures using data on both partners in a family. 


Head of Household Measures 


This class of approach is characterized by the assumption that a family's class 
position is determined by the occupational position of only one person in the 
family, namely, the head of the household. The conventional view chooses a 
particularly simple solution to this question by using the occupational position 
of the male head of the household as the single indicator of the family's social 
class position, and the simplest variation on the conventional approach is that 
ifthere is no male in the household then the occupational position of the female 
head designates the family's class position. These two ways to measure a 
family's social standing without doubt have dominated research on social 
mobility and status attainment both in the United States and in Europe from 
the 1950s until today (e.g. Blau & Duncan 1967, Featherman & Hauser 1978, 
Erikson & Goldthorpe 1992a; Ganzeboom et al 1992, Kurz & Mueller 1987). 

A further modification lets the head of the household be the person with the 
dominant occupational position (Erikson 1984). In Erikson's dominance model, 
the head of the household is the person with the strongest attachment to the labor 
force and the occupational position that requires the highest qualifications. 
Should partners have positions requiring a similar level of qualifications, then 
nonmanualpositions are assumed to dominate manual positions, and the self-em- 
ployedto dominate the employed. In general, the dominant occupational position 
is the position that can be assumed to be more important in terms of defining or 
influencing the family's lifestyle and interests through the income or status or 
prestige it brings, or through the style of life it imposes on the family (self- 
employment is important in this respect). The dominance model then allows for 
the possibility that the woman may have an occupational position which makes 
herthe head of the household, and thus the person who has the greatest influence 
on the family's lifestyle, consumption patterns, interests, etc. Moving from the 
conventional to the modified conventional to the dominance model can be 
conceived of as a move from a patriarchal view to an empirical approach as to 
who is the head of the household. In the patriarchal view, the household head is 
always a man. The first modification allows a woman to be the head if there is no 
male in her family. The second modification allows a woman to be the head if she 
occupies a position which can be seen as having a greater influence on the family 
than that of her partner's. 
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The dominance model has not gained wide usage in the research community, 
possibly because so few women occupy the dominant position (Erikson & 
Goldthorpe 1992a:267). A classification based on the dominance model will 
then typically not differ very much from the classification resulting from the 
operationally much simpler modified conventional approach. Conceptually the 
dominance model differs from the conventional approach in one fundamental 
respect: Occupational position is assumed to be more important than gender 
in the determination of a family's class position. This has the important con- 
sequence that a woman's class position under the dominance model does not 
necessarily change with her marital status, as it does by definition under the 
conventional approach. 


Joint Classifications 


The joint classification approach to measuring a family's social class is still 
in its infancy, and it seems fair to say that little consensus exists on either the 
best way to combine information about both husband's and wife's occupational 
class into a joint measure of the family's class position, or on the wisdom of 
doing so, as discussed previously (Goldthorpe 1983, Erikson & Goldthorpe 
19922). 

The simplest and most common method is to use both husband's and wife's 
occupational position as variables in statistical models of some outcome vari- 
able, such as class identification (Davis & Robinson 1988, Hammond 1987, 
Wright 1989). This approach has been used in research where the family's 
Social class position is used as an independent variable. It has been most 
common to introduce husband's and wife's class as additive variables, but 
clearly it should be possible with interaction effects to identify combinations 
of classes that may be of particular interest, or to test specific hypotheses about 
the effects of class heterogeneity within families. Well-known examples of the 
former strategy are status attainment studies examining the effects of both 
father's and mother's occupation and level of education on children's educa- 
tional and occupational attainment (Boyd 1982, Hayes 1993, Marini 1980, 
Rosenfeld 1978, Sewell et al 1980). A rare example of the latter possibility is 
found in a study which used a so-called diagonal reference model for the 
analysis of mobility tables (Sobel 1981) to analyze husbands' and wives' 
voting behavior (de Graaf & Heath 1992). It was hypothesized that class-ho- 
mogeneous families represent the "core" of each social class, and that this core 
“might be expected to establish common norms and values which set the 
reference for other, heterogamous couples" (de Graaf & Heath 1992:313). 
Deviations from this basic model in turn allowed the researchers to test six 
different hypotheses about the "best" way to measure the family's class posi- 
tion. One corresponded to the conventional approach, another to the dominance 
model, and a third to a model where individuals' voting behavior is mostly 
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influenced by their own class position (the individual approach). The fourth 
hypothesis was that there has been a trend towards individualism, and the fifth 
was that this trend has been more true of women than men. Finally, the sixth 
hypothesis was that the trend has been not toward individualism, but toward 
sharing, that is, toward a situation where both men's and women's voting 
behavior is influenced by their own position as well as by their spouse's class 
position. This research is an example of a promising way to model the influence 
of men's and women's work position on the familiy's class position. Unfor- 
tunately most studies so far have relied on simple additive models where the 
main purpose seems to have been to establish whether women's employment 
"matters" in some unspecified way. 

Some attempts have been made to construct a single measure of the family's 
class position based on the joint classification of husband's and wife's class, 
but the attempts remain few. Some have seen nontraditional cross-class fam- 
ilies, that is, families where the wife has the higher status, as particularly 
important because not only are they cross-class families but also families with 
nontraditional gender roles (McRae 1986); others have argued that cross-class 
families, where one partner is in the working class while the other is not, 
represent a group of families where the internal contradictions should be 
especially strong (Leiulfsrud & Woodward 1989:76). 

It has also been suggested that it is the degree of heterogeneity or difference 
between spouses' class position that is the dimension of cross-class families 
that should be measured. Using Wright's class schema, Graetz (1991) subdi- 
vided the cross-classification of husband's and wife's class into four subsets 
of cells to describe dual-earner families as class-homogeneous, class-compat- 
ible, class-mixed, or class-opposing. Diagonal cells were the class-homoge- 
nous cases, and the class-compatible cases were found in the off-diagonal cells 
within each of the three broad class categories: employers and petty bourgeois, 
middle class, and working class. Families where one partner had a job in the 
working class and the other had a job in the employer-petty bourgeois class 
were defined as class-opposing families. This left class-mixed families as 
combinations of middle-class and working-class jobs or of middle-class and 
employer-petty bourgeois jobs. 

These different proposals to classify cross-class families all focus on dual- 
earner households. The class position of families with only one person in the 
labor market, or where both husband and wife have work positions in the same 
class, is usually not considered particularly problematic. These two types of 
families are both classified as class-homogeneous, and their class position is 
simply the class in which one or both partners work. Thus, the presence or 
absence of a housewife typically is assumed not to affect the class position of 
a family (Erikson 1984, Graetz 1991, Leiulfsrud & Woodward 1989, Wright 
1989); a worker married to a worker has the same class position as a worker 
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married to a housewife. This assumption is surprising since a major reason for 
developing new measures of the class location of families is the hypothesis 
that women's employment makes a difference for the family's lifestyle, inter- 
ests, etc. 

How common are class-heterogeneous families? This will clearly depend 
on the number of dual-earner families, and on gender differences in occupa- 
tional position. It also depends strongly on the number of class categories used, 
since the proportion of off-diagonal cells in a cross-classification will be 
higher, the more detailed the two marginal classifications are. A study in 
England that defined heterogeneous families as those where one partner had 
a manual position and the other a nonmanual position (Britten & Heath 1983, 
Heath & Britten 1984) found that 20% of all families (36% of dual-earner 
families) according to this criterion were cross-class families. In a comparison 
of the United States and Sweden, Wright (1989) found that 1896 of American 
families and 2696 of Swedish families were heterogeneous with respect to the 
individual class locations of the husband and wife. This study employed a class 
scheme with three categories, namely, the self-employed and the petty bour- 
geoisie, the middle class, and the working class. A comparative study using 
Wright's first class scheme (five categories) found that the percentage of 
class-heteregeous families ranged from 3996 in Finland and the United King- 
dom to 4496 in Sweden and 46% in Norway. Limiting cross-class families to 
those where one partner was a member of the working class, the figures were 
reduced to 28-29% for Finland and the United Kingdom, to 33% for Sweden, 
and 3496 for Norway. Data for Australian families classifying families by the 
degree of class heterogeneity showed that 1796 were class-homogeneous, 1696 
class-compatible, 16% class-mixed, and only 3% class-opposing; 46% had 
only one earner (Graetz 1991: Table 3). Hence, there is a great deal of variation 
in the estimates of the frequency of class-heterogeneous families. This is not 
surprising, given that each study defines cross-class families in a different way. 

There is no doubt that class-heterogeneous families exist, and it is also clear 
that women in some of these families have the dominant work position. The 
question is what the implications of their existence are for the determination 
of a family's class position. If the conventional approach should be abandoned, 
what should it then be replaced by? The alternatives are the dominance ap- 
proach and the joint classification approach. The choice between them can be 
made on conceptual grounds, or on empirical grounds. An example of the 
former, discussed previously, is the rejection of joint-classification measures 
on the grounds that they will produce “too much" mobility (Goldthorpe 1983). 
If the choice between measures is made on empirical grounds, the preferred 
measure will be that which shows the strongest association with indicators of 
the family's material conditions, lifestyle, etc. 

It is surprising, given the heated debate over alternatives to the conventional 
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approach and the frequent references to these class correlates, that there are 
very few studies comparing the explanatory power of different measures of 
the family's class position. One exception is a study that compared six different 
measures of family class using data from the Swedish Level of Living surveys 
(Erikson 1984). Four of these measures are of particular interest here, given 
the previous discussion: (i) a measure of the modified conventional type, (ii) 
a measure based on the dominance principle, (iii) a measure based on the 
principle of joint classification, and (iv) a measure where the family's class 
position is indexed by the respondent's class position, whether it is a male or 
a female. These four indexes of class were correlated with ten measures of the 
family's “market position"; these included family income, size of the dwelling, 
house ownership, and a measure of the extent of cultural activity. Compared 
to the conventional measure, the dominance measure of family status explained 
slightly more of the variance in all but one of ten indicators. The joint-classi- 
fication measure explained somewhat less variance in eight, and somewhat 
more in two indicators, and the individual measure was consistently out- 
performed by the conventional (and all other) measures. It should be noted 
that none of the class measures explained very much of the variance in most 
indicators of the family's living conditions and life style, the exceptions being 
family income, where the variance explained ranged from 19.9% to 25.3%, 
and cultural activity where it ranged from 15.396 to 17.296. The conclusion of 
this study was that the conventional approach was outperformed by the dom- 
inance approach: "It is profitable to take the occupations of both husband and 
wife into account" (Erikson 1984:512), but this should be done not by devel- 
oping a joint classification measure, but by letting the family's status be 
measured by the dominant class position in the family. 

This is, of course, only one study, and it is not clear that the results can be 
generalized. More research on these issues is definitely needed. 


CONCLUSION 


The question of how to incorporate women in class analysis and stratification 
research has been the topic of heated debate and much controversy in recent 
decades. I have suggested that disagreements about the "proper" unit of anal- 
ysis—the family or the individual— are really disagreements about what the 
research concern should be in class analysis or stratification studies. The 
distinction between the class position of individual men and women, and the 
class position of families is crucial, and it has been made more necessary by 
the changes that have taken place regarding women's economic roles. These 
changes require that women be included in the study of individual careers and 
in class analyses that see classes as “empty places" in the system of production 
or in the labor market. However, while the family and its class position remain 
an important area of stratification research, women's new economic roles have 
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called into question the conventional approach to the study of the family's 
class position which uses the class position of the male head as the measure 
of the family's class position. 

The empirical evidence is to some extent in favor of the conventional 
approach, and its use in empirical research probably has not resulted in serious 
misrepresentations. Nonetheless, there seem to be sufficient grounds for rec- 
ommending a change in the procedure for determining a family's class position 
in future studies. Research on subjective class identification and voting behav- 
ior has shown that women's own class position does influence their attitudes, 
and in some families, women's own class position has a stronger effect on 
them than does their husband's class position. Research on intergenerational 
mobility has also shown that the conventional analysis of male-only tables to 
represent the whole population underestimates the degree of openness in the 
mobility regime. The bias is not large, but it is nonetheless there. 

The inclusion of women's class position in the determination of the family's 
status will make an empirical problem out of women's employment. Instead 
of assuming that women's employment poses no empirical problem, as the 
conventional view would have it, this question is better left open for empirical 
study. For then it becomes possible to determine how much women's class 
position matters for their family and to examine the conditions under which 
women's work position becomes more important than that of their partner's. 
It also allows one to study the question of why women's employment in some 
families (and possibly in many) does not have any influence on the family's 
lifestyle, on its material conditions, or the interests and attitudes of its members. 
In other words, a replacement of the conventional approach to determining the 
family's class position will make it possible to address many questions that 
are central to our understanding of the class position of families. It will be up 
to future research to develop these new approaches. 
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Abstract 


Oppositions and deadly conflicts among ethnic collectivities are important 
around the world. Ethnies (our term for ethnic groups) also strongly affect 
interstate relations. Both interethnic and ethnic-state conflicts tend to be severe, 
protracted, and intractable. At the extremes, the stakes are total: survival versus 
genocide. 

Competition and rivalry for individualized economic and political goods are 
important, but the most intense conflicts are to be expected when the stakes 
are collective goods, including categorical claims to prestige and political 
authority. 

States are major actors in creating, accentuating, or diminishing ethnic 
identities. States are both arenas of rivalry and conflict and resources for ethnic 
mobilization and counter-mobilization. 

Because both ethnies and states are diverse, careful specification is required 
for fruitful analysis. The same dictum applies for the diverse types of opposi- 
tions and of conflicts. 

Ethnic conflicts arise from complex combinations of ethnic strength, class, 
inequality, political opportunity, mobilization resources, interdependence, and 
international interventions. Frequent but nonviolent protests, for example, are 
most likely by organized collectivities with substantial resources, operating in 
relatively open political systems. International aid to parties in domestic con- 
flicts appears to prolong and intensify ethnic struggles. 

Research in this field contends with many difficulties, and one-sided theories 
do not fare well. Yet abundant descriptive materials are available, statistical 
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techniques are improving, conceptual clarification continues, and substantive 
knowledge does accumulate. Accordingly, there is hope for better understand- 
ing of some of the most destructive and tragic conflicts of our times. 


INTRODUCTION 


The current significance of this topic is not in doubt. First, the world-wide 
prevalence of ethnic diversity is indicated by the presence of about 6000 
languages (Grimes 1988) and somewhere between 900 (Murdock 1967) and 
1600 (Levinson 1991—1993) major cultural groupings. For example, there were 
about 130 officially recognized "nationalities" in the former Soviet Union. 
Resurgent ethnic movements have appeared in many Western countries. Sec- 
ond, of the 183 states in the United Nations (as of June 1993), only a handful 
are ethnically homogeneous; multiethnicity is the rule. Third, in a warring 
world (Brogan 1990), ethnic-related conflict is frequent and often deadly. One 
half of the world’s states have experienced significant ethnic conflict since 
World War II. About 80% of deaths in warfare during that period have been 
internal to national states (Russett & Starr 1989:171), and much of that total 
has come from collective ethnic violence. Estimates of the number of deaths 
attributable to ethnic violence since 1945 vary widely; a minimum estimate 
of 11 million has been given by Topor (1992), but an upper figure of 20 million 
fatalities does not appear to be an excessive guess. In the ultimate case of 
genocide, as Leo Kuper (1985:161) has noted, the majority of deaths in do- 
mestic genocides result from struggles for power by ethnic, racial, or religious 
groups. (Of the vast literature on genocide we mention here only Hovannisian 
1986, Conquest 1990, and IL Horowitz 1992.) 

Over the past quarter-century scholars have paid rapidly increasing (even if 
belated) attention to these remarkable phenomena. Ethnopolitics, including 
nationalistic developments, is a crucial global force (cf Ragin 1983:1317, 
Moynihan 1993). Prior reviews of research on ethnicity have attributed this 
growing interest to the near-universality of multiethnic states, the persistence 
of strong ethnies in complex modern societies, ideological and policy disagree- 
ments, and the prevalence of severe ethnic conflicts (Olzak 1983, Yinger 1985). 
Meanwhile, at long last, military strategists are paying close attention to 
terrorism (Gibbs 1989) and so-called low-intensity conflicts, which often are 
ethnically based (cf Schultz 1991, Hoffman 1992). 

Because it has become quite impossible in this field to "review all the 
literature," this chapter has been forced to omit the enormous assemblage of 
data and interpretations dealing with religious fundamentalism, e.g. the six 
volumes emerging from The Fundamentalism Project of the American Acad- 
emy of Arts and Sciences (cf Marty & Appleby 1993a,b). Similarly we give 
only passing mention to many important historical studies of particular soci- 
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eties (e.g. Suny 1985). Prior reviews make it possible to omit a survey of recent 
literature on ethnicity in the United States: Williams (1975, 1977, 1978), Porter 
& Washington (1979) Lambert (1981), Willie & Cunnigen (1981), Olzak 
(1983), Yinger (1985), Longshore & Prager (1985), Portes & Truelove (1987), 
Jaynes & Williams (1989). Finally, to observe space limits we must omit the 
large literature on resolution and management of ethnic conflicts. 

Our survey is illustrative rather than comprehensive. The central problem 
is how and under what circumstances interactions among ethnic collectivities 
produce strong oppositions and overt conflicts (Rule 1988:9-10). Much of the 
relevant research has been done by nonsociologists; we happily include that 
work wherever the content is suitably sociological. 

Before entering upon the main task, however, we must briefly set the review 
in a world-context and then examine the central concepts that are essential for 
the later discussion. 


THE WORLD CONTEXT 


In the last decade of this century, the world is increasingly interdependent, 
economically, politically, culturally. Transnational and international organiza- 
tions have become more and more important. Meanwhile national states mul- 
tiply, armaments increase to unprecedented levels, and political instability is 
endemic. At the same time, economic inequalities among countries are great 
and are increasing. Ethnic, religious, and class oppositions place severe stresses 
on weak states. The heritage of the great wave of decolonization through the 
1960s (Boswell 1989, Strang 1990) was a new population of multiethnic states, 
racked by domestic conflict. 

A heavily armed and politically unstable universe of sovereign states regu- 
larly produces some 30 or more wars per year. As of 1992 about 40 violent 
collective conflicts were being fought in 29 countries (Hoffman 1992:26). 

Ethnic conflicts since World War II have been facilitated by the rise in the 
number of new multiethnic states, by state-building activities, increased re- 
sources for mobilization, mass communication, diffusion of ideologies—and 
by external interventions. These developments have helped to instigate ethnic 
conflicts over control of central polities, conflicts over cross-national claims 
to territory, and struggles over autonomy or secession (Nagel 1980:280—82). 
The primary conditions thus are ethnic distinctiveness, geographic concentra- 
tion, interethnic inequality, "alien" state penetration (and fear of exclusion), 
ethnic organization, and external support. 

Ethnic divisions and ethnic-state relations are created, accentuated, reduced, 
or destroyed by the increasing internationalization of political economies, both 
in the division of labor and in the internationalization of state security (Enloe 
1980, 1986). 
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Rapid population growth characterizes the less-industrialized world regions 
where there is great poverty and economic inequality within fragile states. 
World-wide increases in highly destructive weaponry accompany extensive 
militarization. And the (partial) end of the Cold War has created new interna- 
tional uncertainties and has released many formerly suppressed domestic con- 
flicts. 

The enormous flows of international migrants continually reshape ethnic 
relations. The United Nations estimates that in 1993 as many as 100 million 
individuals live outside their country of birth or citizenship; as many as 18 
million are refugees. Numerous guest-workers and other migrating workers 
elicit tensions and some violence in many places, e.g. Germany, France, the 
United States. (We note that Frankfurt is the third largest Turkish city, behind 
Istanbul and Ankara.) 

The structure of intrasocietal cleavages and the history of relationships 
among social formations—ethnies, classes, regions—constitute a global con- 
text of constraints upon and opportunities for collective action. Within such 
contexts, purposive actions by ethnic entrepreneurs and political leaders influ- 
ence subsequent events, leading toward or away from ethnic conflict. Accord- 
ingly, a major dimension of ethnic conflict or cooperation, world-wide, is the 
relation of ethnies to territorial states that claim unitary sovereignty over a 
territory and its population. 


CONCEPTS AND THEORIES 


Problems of definitions just will not go away, so controversies continue re- 
garding how best to characterize such terms as ethny, ethnic group, competi- 
tion, rivalry, conflict, solidarity, mobilization, nation, nationalism, state. Many 
of these terms have been defined and discussed in prior volumes of the Annual 
Review of Sociology, e.g. Williams 1975, Hirschman 1983, Olzak 1983, Yinger 
1985. Accordingly, this chapter treats only the most salient recent discussions. 


Structural Units 


The awkward and potentially misleading term "ethnic group" is so deeply 
entrenched in the professional literature that it may be folly to resist it. Yet 
there is the better word, “ethny,” proposed by van den Berghe (1981:22, cf 
Dutter 1990) and used routinely by AD Smith (1981); we shall use it here 
whenever convenient. The term captures the core idea: "The prototypical ethny 
is ... a descent group bounded socially by inbreeding and spatially by territory" 
(van den Berghe 1981:24). From this core, more extended ethnies develop by 
preferential endogamy, extended nepotism, fictive kinship, descent myths, and 
extension into large territories. At one extreme is the small, cohesive, closed 
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local ethny; at the other, the vastly extended "symbolic" boundaries of modern 
ethnic identities (Yinger 1985:161). The moral: there are many different kinds 
of ethnies; analysis needs to make specific distinctions. Numerous concrete 
types of ethnic collectivities obviously exist: indigenous native populations, 
refugees, guest workers, ethno-territorial groupings, diaspora, ethnoclasses, 
and so on (Thompson & Rudolph 1986:29-36). "Race" is a special case of 
ethnicity. 

Our editorial comment can be brief: let us dispense with arguments as to 
what definition is "correct," recognize the inherent diversity, and move on. It 
is essential only to recognize that ethnicity is qualitatively different from such 
markers as ideology or occupation (Dutter 1990:312). 

Scholarly controversies are endemic concerning the relationships between 
class and ethny. The definitional problems do seem open to clarification. Social 
stratification refers to inequalities in the distribution of any scarce value. A 
social class is an aggregate of social units similarily positioned with relation 
to any or all of such scarce values, e.g. control of the means of production 
(van den Berghe 1983:222). To the extent that such aggregates are marked by 
special densities of interaction and by exclusion of nonmembers, they become 
collectivities. Collectivities defined by birth and cultural or physical distinc- 
tiveness are ethnies—which may or may not be ranked, and may or may not 
be internally stratified. A ranked collectivity enforcing endogamy becomes 
over time an ethnoclass or caste. Large ethnies typically are both ranked and 
internally stratified. 

How is ethnicity related to "nation" and "nationality"? A useful character- 
ization: “A nation (but see Gellner 1983) is a politically conscious ethny, 
claiming statehood rights on the basis of common ethnicity" (van den Berghe 
1983:222, cf Smith 1989:342). Nationalism is an ideological movement in 
support of a nation. Note, however, that the term "nationalism" is used to refer 
to two quite different conceptions: (i) identification with and loyalty to the 
state structure, regardless of ethnic composition of the population (American, 
Argentinian), (ii) identification with and loyalty to an ethnic/religious "nation- 
ality" that may or may not coincide with a state's jurisdiction (Scottish, Breton, 
Tamil) (Connor 1981:201). The importance of languages for both ethnicity 
and nationalism is now widely recognized (cf Conversi 1990, Balibar 1990: 
350, Eriksen 1990). 

The bases of politically significant collective solidarities thus include kin- 
ship group (family , clan), region, class, language, religion (or sect), tribe, 
ethny, and state (country). Within national, centralized states, cleavages based 
on substate affinities constitute an explosive component, world-wide (cf Esman 
& Rabinovich 1988). Since few territorial (national) states are populated by a 
single ethny, the term “nation-state” is muddled and intellectually dangerous 
(Connor 1981). In a long line of devastating critiques, Ra'anan's 1990 article 


54 WILLIAMS 


shows why this semantic monstrosity should no longer be casually used (in 
Montville 1990:5-20). 

Since Eugen Weber's crucial study of French nationalism (1979), it has 
become increasingly clear that nationalistic unity of modern states is a very 
late and partial historical development (Connor 1990). For example, "France" 
was not a single linguistic or cultural unit among the masses until after World 
War I, at earliest (Weber 1979, cf Watkins 1991). As Connor (1990:92) says, 
nationalism as a mass phenomenon cannot be directly inferred from elite 
claims. 


Processes 


These, then, are our central structural units: ethnies, nations (nationalities), 
classes, and states. What are the processes within and among these units to 
which our review must be addressed? 

First, there is the notion of competition, usefully defined as "striving for 
scarce objects under rules which limit the damage competitors can inflict on 
each other" (Thompson & Quets 1990:249, cf Williams 1947:43). (Rivalry is 
competition among specific, identified actors.) It is crucially important to 
distinguish between peaceful individual competition and regularized collective 
opposition—i.e. political, economic, and cultural contention—and overt con- 
flict. It is correct that almost always ethnic conflict has been based in part 
upon fear of competition (or rivalry). But an essential difference exists between 
rule-constrained individualized striving for scarce values and collective actions 
aimed at displacing, neutralizing, injuring, or destroying opponents. 

Conflict of course, has been given numerous definitions—psychological, 
cultural, and social. Social conflict was defined some decades ago as “a 
struggle over values (distributive or nondistributive) in which the immediate 
aims of the opponents are to neutralize, injure, or eliminate their rivals" 
(Williams 1947: 43). 

This characterization points to a general area of interest. For particular 
research purposes, however, it is necessary to identify a complex range of 
more specific phenomena. The main forms of collective ethnic conflict are (i) 
turmoil (strikes, demonstrations, mutinies (Rose 1982), protests, sabotage, 
communal rioting, and terrorism); (ii) internal war (coups d'etat, secessionistic 
rebellions, civil wars and revolutions); (iii) genocide. Each of these involves 
further distinctions, e.g. there are many complexities in defining terrorism 
(Gibbs 1989), rebellion (Boswell & Dixon 1990:540), or even coups d'etat 
(Collihan & Danopoulos 1993:435, DL Horowitz 1985: Ch. 12, cf Hintzen 
1989). 

Such distinctions among types of conflict involve (i) the extent of mass 
mobilization, (ii) the extent of centralized organization and control, and (iii) 
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the amount and types of violence. “Turmoil” generally is low on all three 
criteria; revolutions maximize all (see Zimmerman 1980:168—71). 


Recent Developments in Research and Theories 


The Preface to Volume 18 (1992) of the Annual Review of Sociology has said, 
“The essays in this volume reflect the increasingly comparative nature of 
sociology, whether international or historical" (p. v). Our own survey indicates 
that in the field of ethnic studies, comparative sociology still has room to grow. 
Until very recently the sociological literature in the United States was very 
heavily weighted toward studies of domestic populations, especially blacks 
(Lavender & Forsyth 1976). A similar survey of political science journals 
found (i) very little attention to any ethnic topics, and (ii) major emphasis on 
blacks (7096 of articles dealing with ethnic groups) (Flood 1980). 

Correspondingly, "theories" of ethnic conflict have been primarily focused 
upon intrasocietal processes: split labor markets, (Bonacich 1976), economic 
competition (Banton 1983, Olzak 1983, 1992, Olzak & Nagel 1986), internal 
colonialism (Hechter 1975), assimilation (Gordon 1964, Hirschman 1983, 
1991). Yet such domestic processes manifestly are strongly affected by eco- 
nomic and political rivalries among national states and by many other trans- 
national and international influences. Recognition of such influences since the 
1970s has led to greater attention to the part played by state power and 
resources and international conflict and cooperation (Gurr 1980, Greenwood 
1985, Brass 1985, DL Horowitz 1985, Gurr 1993). 

A rich set of descriptive studies of varied ethnic situations around the world 
has provided a sort of inventory of conditions affecting interethnic relations 
(cf Horowitz 1985, Montville 1990, Portes & Rumbaut 1990.). This inventory 
includes: type of state and regime, political resources of ethnies, number and 
size of ethnies, relations of ethny and class, regional inequalities, relative 
deprivation, demographic "balance" and changes therein, cultural content of 
ethnic identities, extent and kind of “modernization,” political party structures, 
sequences of interactions among ethnies and between them and the state. For 
example, dependency, world system and global hegemony theories of domestic 
conflict have paid special attention to weak authoritarian or elitist states in the 
world-system periphery; the primary mechanisms posited are class polarization 
and capitalistic intrusions upon "traditional" societies (Jenkins & Schock 1992: 
176-79). 

A tendency among theorists in this field is to focus upon a single factor or 
set of conditions and to ignore, downplay, or reject others, So, it is said, the 
primary factor in ethnic conflict is resource mobilization, or competition, or 
state actions, or internal colonialism, or economic inequalities, or labor market 
segmentation, or ethnic inequalities, or social strains and anomie—and so on. 
Fashions abound—at one time socially induced motivations such as relative 
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deprivation are in favor—only to be rejected out of hand at a later time. A 
major exception to the unidimensional theorizing is the work of Pinard and 
associates (see Pinard & Hamilton 1984, 1986), who propose a multifactorial 
scheme of six components: deprivations, aspirations, moral obligations, col- 
lective incentives, selective incentives, and expectations of success (Pinard & 
Hamilton 1986). 

For several decades the academic industry created by allegedly antithetical 
models has flourished: sociobiological perspectives (van den Berghe 1981, 
1990, Shaw & Wong 1989, van der Dennen & Falger 1990), primordial 
solidarities (Geertz 1973, Stack 1986), competition (Olzak 1983); group mo- 
bilization (Klandermans 1984, Tarrow 1988); internal colonialism (Hechter 
1975, 1987); cultural content (Prager 1982, B Anderson 1983, Smith 1988), 
rational actors (Muller & Weede 1990). 

Bfforts to reformulate and integrate the diverse approaches have been few 
but not wholly lacking, e.g. Williams (1978), McKay (1982), Scott (1990), 
and Gurr (1993). 

Research methods used in studies reviewed here cover a very wide range, 
indeed. Case studies predominate in the descriptive work, although increas- 
ingly there are explicit comparisons (cf again, DL Horowitz 1985, Wickham- 
Crowley 1992). Ragin (1991) has given a thorough introduction to the prob- 
lems of emphasis on variables. On the whole, comparative social science in 
the past has been preoccupied with case studies, but recent cross-societal 
studies have shifted the focus to multivariate statistical analysis of many cases. 
New methodologies—e.g. event history analysis and qualitative comparative 
analysis—hold some promise of combining analytic generalizations with con- 
textual specification. 

Recent years have brought a marked convergence between studies of ethnic- 
ity and studies of nationalism (Smith 1992). Increasingly also we are seeing 
cross-state statistical analyses of large data sets. By 1980, Gurr could say that 
modern research on conflict is characterized by the use of systematic methods, 
comparing phenomena across settings or across time, and by widespread reli- 
ance on quantification and statistical techniques (1980:4). 

Although the great majority of ethnic studies rely on cross-sectional data, 
some now analyze sequences of conditions or events (Hannan 1979, Olzak 
1983, Strang 1990) either in narrative accounts or by event-history techniques. 
Case studies actually reinforce the advocacy of sequential approaches. Thus, 
in a review of nine major works on Ethiopia, Colburn (1991) has urged that 
an extended sequence of *misgovernment" must be given an important place 
in analysis of revolution and violent ethnic separatism. Not just prior structural 
conditions but active choice, he contends, must be part of an adequate under- 
standing (Colburn 1991:585—86). 

The comments by Jenkins & Kposowa (1990:862) concerning hypotheses 
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and theories that attempt to explain military coups apply to ethnic conflicts 
more generally: “They have been treated as single propositions evaluated by 
case analysis or conventional regression techniques. We believe these are 
complex theories with multiple causal links that should be evaluated with 
complex causal models.” 


THE CHARACTER OF ETHNICITY: CONTROVERSIES 
AND CONVERGENCES 


The repetitive arguments between advocates of primordial and instrumental or 
situational conceptions of ethnicity can and should be superseded. Ethnies are 
both primordial and circumstantial—in different ways under different condi- 
tions. Some ethnies persist for many centuries (Smith 1992); some are newly 
emergent; some have fluid boundaries, others are rigid entities. It is important 
only to adequately specify the character of whatever units we make the object 
of analysis. Modern nationalisms are analogous to older forms of ethnic elec- 
tion (cf Armstrong 1982, van den Berghe 1981). From time to time, studies 
of ethnicity rediscover the distinction between category and collectivity—eth- 
nic populations or categories versus ethnic communities or groupings (ethnies). 
Categories turn into ethnies through the development of membership and 
identity arising in social interactions. Ethnic networks develop and ethnic elites 
and leaders can then seek to advance or defend collective interests through 
collective action (Martiniello 1993:238—43). 

Both van den Berghe (1981) and DL Horowitz (1985:52—70) indicate that 
ethnies involve a belief in common descent (affinity through ascription), some 
notion of distinctiveness, and a membership transcending face-to-face interac- 
tions, It is central to ethnicity that there is always an important emphasis upon 
descent; it is this ascriptive quality, above all, that renders ethnic cleavages so 
often intractably conflictful. 

In the long term, there are both major changes in ethnic classifications and 
considerable mobility across ethnic boundaries. Thus, ethnicity may be seen 
historically as arbitrary, variable, and ambiguous. But in the short term, ethnic 
boundaries often are strongly correlated with social, economic, and political 
status, and such inequalities are often reinforced by categorical discrimination 
and public policies. Hence, ethnic divisions can become the focus of strong 
passions and of collective conflicts (Hirschman 1986). The "construction" (or 
“emergence”) of new ethnies out of previously diverse cultural groupings or 
the subdividing of previously unitary collectivities is described in numerous 
studies: of Trinidad (Brereton 1979), “Hispanics” and “white ethnics” in Uni- 
ted States (Alba 1990, Waters 1990, Jiobu 1990), US Indian tribes (Barsh & 
Henderson, 1980), Cote d'Ivoire (Vogel 1991), Punjabis in California (Leonard 
1992), Sierra Leone (Cohen 1981), India (Schermerhorn 1978)—to name only 
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a few. Works that emphasize ethnic change or fluidity include Eller & Cough- 
lan (1993), Nagel & Snipp (1993), Heisler (1990), Fugita & O’Brien (1991), 
B Anderson (1991) [1983]. More complex or integrative views are represented 
by van den Berghe (1981), DL Horowitz (1985), Greenwood (1985), Scott 
(1990). It is clear that present-day ethnic populations have been constructed 
over time by conquest, religious movements, migration, biological blending, 
acculturation, and absorption of ethnic-linguistic groups. Thus, ethnic bound- 
aries shift historically, and many individuals have multiple ethnic identities. 
But, as Smith has stressed, ethnicity can be and is, as Smith notes, historically 
perennial without being a fixed essence—rather ethnic continuity resides in a 
varying but basic boundary perception (cf Smith 1989, 1992). Other studies 
that point to the persistence of ethnies and to their resistance against forces of 
change include Spicer (1971), Smith (1981, 1989, 1992), Douglas (1988), 
Kimmel (1989), and Petrissans (1991). 

Assertions that many modern ethnies are artificially created, fabricated, 
instrumentally formed, or imagined are mostly irrelevant for the conclusions 
to be drawn from ethnic revivals and upsurges. The intense passions manifest 
in modern cases of ethnogenesis make pointless the constructionist skepticism: 
the solidarities clearly do exist and are accompanied by strong sentiments and 
collective actions (cf Fishman 1981:239). It is striking testimony that a work 
entitled Creating Ethnicity reaches this conclusion: 

Ethnic groups and their cultures are not merely a completely arbitrary construct: 

there is always a minimum of incontestable and noninterpretable facts necessary 


to win something from the opponent. ...The reality is very elastic but not totally 
arbitrary. (Roosens 1989:156) 


Of course, there are social identities and formations below ethnies, e.g., 
families, lineages, villages, clans, and there are identities above the ethnies, 
e.g. religious organizations, regions, states, interstate organizations, transna- 
tional organizations and social movements. Ethnicity is not everything, but its 
place in social conflict can now be well defined. 


THE STAKES IN ETHNIC OPPOSITIONS 


Grievances and Objects of Contention 


The notion of grievance is often conflated with related concepts of dissatis- 
faction or deprivation. But to say "grievance" is to say that someone is 
aggrieved, not merely deprived, disappointed, frustrated, or dissatisfied. A 
grievance is "a wrong considered as grounds for complaint" (Random House 
Dictionary of the English Language)—that is, a claim is made that an injustice 
has occurred. Just as a claim is a socially legitimated demand, not just a desire 
or want, so a grievance is a normative protest, not just a dissatisfaction. The 
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conceptual distinction has deep social roots. The special intensity so evident 
in many ethnic conflicts frequently arises from a sense of victimization (Zas- 
lavsky 1992) arising from loss of autonomy, loss of historically claimed ter- 
ritory, infringement of prior rights, or generally, treatment thought to be unfair- 
ly discriminatory (Harris 1977). These considerations suggest a specific hy- 
pothesis: The likelihood of conflict is higher when disagreement and opposition 
concem collective goods, e.g. language rights, religious beliefs and symbols, 
civil and political rights and privileges, regional-ethnic power, or regional-eth- 
nic parity in the economy. The more nearly indivisible the goods and the less 
the access of the “disadvantaged,” the greater is the resentment and the more 
likely is ethnic mobilization, followed by overt conflict (see Fraser et al 1990). 

Conflict is favored when there is interethnic rivalry and at the same time 
the ethnies have low positive interdependence (complementarity), are unequal 
in socioeconomic status, and when the subordinated ethny perceives the su- 
perordinate collectivity as violating norms of fairness (Belanger & Pinard 
1991). Anything that greatly increases ethnic salience is likely to produce 
increased attention to collective goods and—given ethnic inequality—to in- 
creased apprehension of discrimination. What then generates perceptions of 
unfairness is competition/rivalry when an ethny is subordinated or disadvan- 
taged in economic opportunity, social status, political voice and rights, or 
cultural expression. These conditions stimulate grievances that acquire a tinc- 
ture of moral outrage (cf Belanger & Pinard 1991:449). 


Control of Territory 


Some of the intractable ethnic conflicts of recent years have involved disputed 
claims to land: “territory” has not lost its significance as an object of contention 
(cf Kamen 1991). Many ethnoregional conflicts are directed against the state 
in efforts to gain or restore control over a homeland: as in Quebec, Scotland, 
southern Sudan, Eritrea, Tamil Eelam (Sri Lanka), Kosovo, Biafra, Sikh Khali- 
stan (India) (Esman 1985:438). Other conflicts are directed toward rival ethnies 
(often in-migrants) that take land or are thought to threaten to reduce an 
indigenous ethny to a demographic minority in danger of losing control of the 
State and the economy, e.g. Malays, Fijians, Assamese. In such situations a 
central fear is that of being overwhelmed and dispossessed (Esman 1985, 
Horowitz 1985:166—81). A frequent pattern is for violent conflicts to erupt in 
societies in which several ethnies claim priority of indigenous status, as in 
Lebanon and Yugoslavia. 


Other Stakes 

The stakes in ethnic rivalries include participation in political decisions, cul- 
tural status, economic opportunities (Esman 1990:58—59). The great difficulties 
of resolving or managing many of the world’s ethnic confrontations reside in 


60 WILLIAMS 


the ascriptive character of ethnicity and its high symbolic content (e.g. the 
indivisible good of collective prestige) (DL Horowitz 1990:115—-16). The ev- 
idence is convincing that social respect and prestige are real and important 
foci of ethnic rivalries and of violent conflicts. 

The overwhelming impression created by the research literature and the 
events to which it refers is the extreme fragility of economic interdependence 
in confrontations involving polarized ethnies, as in Northern Ireland (See 
1986), Cyprus, Yugoslavia, the Soviet Union, Rwanda (Newbury 1988), Sri 
Lanka (Tambiah 1986). 


ETHNICITY AND NATIONAL STATES 


The Context 


Ethnic relations affect and are influenced by relations among states. This basic 
and important truth has been somewhat obscured by the concentration of much 
scholarship in history and political science upon states and by the parochial 
fixation of much sociological research upon domestic ethnic relations. A step 
forward is Esman's (1990) description of six categories of transnational or 
interstate influences involving ethnies. Ethnic ties do not stop at state bound- 
aries: nearly two thirds of the 227 communal groups included in Gurr’s analysis 
have kindred in another country (1993:175). There are numerous international 
networks of communication, travel, political support, and material assistance; 
these networks are especially prominent among Muslims and indigenous peo- 
ple. Some 42% of major ethnies are split among two or more states, as witness 
the familiar cases of Kurds, Basques, Tamils, Hungarians, Bakongo, Balochi 
(Nielsson 1985). Among some of the upwards of 100 million persons who live 
outside their countries of citizenship (Teitelbaum 1992:208), there is a dis- 
placed localism that can lead to unrestrained nationalism of a militant kind (B 
Anderson 1991 [1983]). And communal ethnic conflict is itself often conflict 
about the nature of the state and its authority—as in the classic cases of 
Belgium and Quebec (Covell 1985:233, Breton 1988). 

More generally there is evident a world-wide tension between the principle 
of status based on individual achievement and that of status based on ascribed 
collectivity-membership. Striking examples are found in India (Beteille 1991) 
and in Malaysia (Horowitz 1985, Hirschman 1986, Marty & Appleby 1993a,b). 

Evidently sociological analysis needs to consider ideological factors. "Na- 
tional self-determination" or "self-determination of peoples" is a variant of 
doctrines of popular sovereignty. If sovereignty of a state rests upon the will 
of the people it claims to encompass, then the legitimacy of states can be 
challenged by the claims of subnationalities. If at the same time it is claimed 
that sovereignty is absolute, so that states treat as illegitimate all external 
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humanitarian efforts or human rights interventions, then ethnic issues can 
become crucial not only for regime and government stability but also for state 
autonomy and international relations. That "foreign" rule is illegitimate is 
indeed an explosive idea in a world of multiethnic national states (cf Connor 
1990, 1992). 

In the crucial test-case of genocide, the conspicuous failures of the United 
Nations to intervene have been analyzed in detail by Kuper (1985). Not only 
do heads of states fear intervention elsewhere as a precedent that may later be 
used against them, but reluctance may be heightened also by the circumstance 
that political mass killing has been a means to power on the part of many 
governments represented in the United Nations (Kuper 1985:128). 


SOURCES OF OPPOSITION AND CONFLICT: 
HYPOTHESES AND FINDINGS 


Introduction 


Given then, that there are ethnies and other relevant social formations, what 
are the sources of the many different kinds of oppositions, protests, and con- 
flict? First of all, the literature tells us that the objects of ethnic contention 
include such valuables as use of residence areas, control of territory, credit, 
jobs, markets, capital, armaments—and collective goods such as language use, 
access to and control of education, civil rights, practice of religion, cultural 
expression, collective respect and prestige, and political rights. Any or all may 
become the focus of confrontations between ethnies and states or among 
ethnies themselves. Second, many studies focus upon particular types of con- 
flicts—attempting explanations of communal violence, riots, coups d'etat, 
secessionistic rebellions, or revolutions. Third, many other studies reverse the 
perspective to single out causal variables (e.g. demographic changes, class 
inequalities) of particular explanatory models alleged to hold across diverse 
settings and conflicts: rational choice, economic competition, cultural content, 
political mobilization, modernization, and so on. 

To begin with, we examine some general processes of opposition and con- 
flict. There are marked differences in the conditions associated with different 
types of ethnic rivalry and discord. For example, Beissinger (1992) analyzes 
data on 2347 nonviolent protest demonstrations and 379 violent mass actions 
in the USSR, 1965—1989, to show sharp differences in prior conditions and 
accompanying contexts. Violent protests varied independently of political op- 
portunity structures; nonviolent protests were less frequent when opportunities 
were either open or closed. Violent protests were characteristic of rural or 
small town settings (in contrast to US experience); nonviolent protests were 
more typically urban. The latter protests represented relatively high levels of 
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social mobilization among relatively well-educated urban populations with 
substantial organizational resources (cf Taagepera 1992, on Estonia). Violent 
outbreaks come from populations with fewer independent resources for col- 
lective action but with a strong sense of victimization. (For several major 
ethnies, violent events actually preceded nonviolent demonstrations.) A prior 
history of nonviolent protest helps to predict nonviolent demonstrations at a 
later time, but no significant relation was found for violent events. Not a history 
of ancient animosities, but rather, current conditions best explain these out- 
breaks of collective violence. 


Processes of Violent Conflict 


It has long been recognized that violent conflicts, once underway, develop 
their own internal dynamics. Conflict is a vortex into which are drawn new 
elements and within which there emerge numerous transformations (Williams 
1965:25). The self-sustaining and transformative nature of violent ethnic con- 
flicts has been repeatedly noted, e.g. Azar & Burton (1986), Azar (1990), 
Panda (1993). These processes are chaos-like—full of probablistic and near- 
random events, not predicted by the actors. A little-understood phenomenon 
is the phase-change in which collective violence produces otherwise unthink- 
able behavior. 

Violence is not limited to challenging collectivities that are seeking inclusion 
in, control of, or exit from a polity. It may be evoked or initiated when a 
collectivity already nominally legitimized as an insider regards its position as 
basically threatened. Although initiation of violence by a challenger often 
carries high risks—increased stakes, possible disunity, harsh repression—nev- 
ertheless violence may emerge during a process of conflict as a viable “last 
resort” (Grant & Wallace 1991). 

Grievances must be taken into account; they are not constructed out of the 
air: objectively ascertainable conditions do matter. Gurr’s 1993 analysis (e.g. 
p. 188) finds that grievances are created by poverty and political and economic 
differentials among communal groups, and that restricted political access and 
a history of lost autonomy are important conditions for separatist demands and 
rebellion. 

Not only diffuse ethnic oppositions and tensions but also separatist move- 
ments draw strongly upon fears of being ‘swamped’ or permanently subordi- 
nated by a dominant ethny or ethnic coalition (Horowitz 1985, Mayall & 
Simpson 1992:15, Pfaffenberger 1990). “Backward” groupings frequently fear 
cultural extinction, but even advantaged ethnies can be aroused to see dire 
threats of subordination. 

The fact that ethnic solidarity may have been roughly constant while overt 
ethnic conflict has fluctuated does not mean that the solidarity is not important 
in explaining conflict—only that other conditions, of threat and opportunity, 
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are necessary to evoke the more militant/violent confrontations. Indeed, soli- 
darity in the sense of mutual recognition of collectivity membership is a 
necessary basis for most forms of collective conflict (Darby 1990). 

Because collective violence typically occurs along pre-existing lines of 
nonviolent oppositions (Rule 1988:266), to account for collective conflict one 
needs to account for collective solidarity. Here several major theoretic issues 
are joined. For example, Hechter has rejected what he calls "normative" and 
“structural” accounts of group solidarity in favor of an individualistic “rational 
choice" model (Hechter 1987). In that model solidarity is created when "the 
group" [collectivity] produces exclusive collective goods for its members, who 
depend upon the collectivity for diverse, multiple goods, and when the collec- 
tivity exerts sanction ("control") against nonconforming members. 

But numerous cases show that structural processes can create and solidify 
ethnic identities. Periodic violent conflicts intensify ascribed membership, and 
the greater the violence the more intense become the pressures for solidarity, 
as noted by Darby (1990:157) for Northern Ireland. In general, “civil violence 
is more likely the greater the solidarity among the participants" (Rule 1988: 
266). And in a cross-national study, Boswell & Dixon (1990:556) note that 
the data show that ethnic conflict, measured as “separatism”, is the variable 
most highly correlated with "rebellion." 

Studies that closely examine the interactive processes of mobilization and 
conflict are rare. McAdam has described the black insurgency of the 1955- 
1970 period in terms of a process of tactical interaction between civil rights 
advocates and their opponents—a continuous process in which tactical inno- 
vations are met by counter-measures (McAdam 1983). Morris emphasized the 
pre-existing bases and internal structure of the civil rights movements as 
important determinants of both initial mobilization and the innovations and 
spread of strategies and tactics (Morris 1981). 

The concept of a repertoire of collective action, as developed by Tilly (1986), 
proposes that particular forms of protest and conflict are historically contin- 
gent—they are used and they "work" only in particular settings. Voluntary 
associations using mass demonstrations to prod political change arose in so- 
cieties with electoral systems. Systematic nonviolent mass protests rest upon 
complex sets of understanding, skills and tactics, typically developed in mod- 
ern societies with urban elites experienced in electoral politics (cf Beissinger 
1992, Zaslavsky 1992, and the prescient work of Amalrik 1969 and d'Encausse 
1978 for the Soviet Union). 

"Political process" theories are said to argue that "collective action stems 
from political struggles over the entry of new groups and is determined by the 
mobilization of contenders and the structure of political opportunities" (Jenkins 
& Schock 1992:170). Groups excluded from effective voice in the polity may 
become unruly—but under what conditions? A major clue is that the intensity 
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of rebellions is higher when potential separatism (primarily ethnic) is great 
and so is political and economic discrimination (Jenkins & Schock 1992:171, 
citing Gurr 1968, Gurr & Duvall 1973, Gurr & Lichbach 1979, Boswell & 
Dixon 1990). 


Class, Economy and Competition 


Recurrent debates as to the relative importance of class position and interests 
as over against ethnic identities and commitments have not produced—and 
cannot be expected to produce—a decisive verdict. To dismiss ethnicity as 
false consciousness ignores the clear evidence that ethnies often sacrifice 
economic interests in favor of symbolic gains, e.g. Sinhalese and Tamils in 
Sri Lanka (Pfaffenberger 1990:249). The relevant question is how the two 
aspects of social reality affect actions in specified contexts (Goyden 1983, 
Norton 1984). 

An instructive example is the collective violence in India (more than 7000 
deaths in five years, 1978-1983) between Assamese Hindus and Bengali 
Muslims. Bengali immigration clearly led to competition for land, jobs, and 
other economic and political resources. But cultural issues (especially of lan- 
guage and religion) become salient in political processes, transforming a sec- 
ular movement into one of ethnic and religious oppositions. The state was not 
a neutra] arena but rather both a resource and a major influence. Conventional 
monofactorial theories fail to account for the violent and protracted conflicts 
(Darnell & Parikh 1988). Appropriate interpretation involves class, demo- 
graphic change, ethnic identities, religious affiliations, state actions, and po- 
litical processes. 

Analyses of labor unrest and ethnic conflict in the United States (Olzak 
1989) have led to the following conclusions: (i) Periods of economic contrac- 
tion show high levels of ethnic conflict. (ii) Rates of ethnic conflict decline 
sharply with time elapsed since the last event of this kind. (iii) Waves of strikes 
and antiblack events are strongly correlated. (iv) But the effect of strikes 
disappears when strength of union organizations are taken into account. (v) 
Yet growth of unions coincides with rates of attacks on blacks— but is nega- 
tively associated with rates of attacks on nonblack groups. (vi) Immigration 
intensified labor market competition among foreigners, native whites, and 
blacks, but two thirds of all collective attacks were against blacks. 

Thus, the targets of ethnic conflict were not predominately the white im- 
migrants (who provided much of the direct job competition), but rather the 
black Americans—who were already objects of systematic discrimination. 

Ethnic rivalry, whether or not extending into violent conflict, takes different 
forms in ranked, in contrast to unranked, systems. When ethnies are ranked, 
as in caste systems, class and ethnic membership coincide, hierarchies are 
locally based, direct interactions of subordinates and dominants are frequent. 
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So long as dominance is stable, struggle takes such forms as malingering, 
sabotage, evasion, work stoppages, and sporadic violence. If collective vio- 
lence eventually occurs, it is likely to mean a social revolution, e.g. Rwanda 
in 1959 (Horowitz 1985:30). In contrast, unranked ethnies within a single state 
tend toward a politics of inclusion/exclusion—seeking power-sharing or con- 
trol, and minimizing intra-ethnic class divisions. If ethnic conflict then be- 
comes overt and polarized, the hostile ethnies seek autonomy and either 
dominance or separation. 

Ethnicity, clearly, often coincides with class. Equally clearly, neither class 
nor ethny is reducible to the other (cf Rothschild 1981). Increased scholarly 
attention to ethnic conflict does not lessen the importance of class and eco- 
nomic sectorial bases for other forms of collective rebellion, e.g. peasant-based 
revolts and revolutions (cf Paige 1975, Jenkins 1982). Yet an emphasis on 
class struggle can lead to a neglect or downplaying of ethnic cleavages as a 
factor in peasant uprisings. Detailed studies of particular cases, not surpris- 
ingly, show complex interactions of class, state, ethnicity, and local social 
structures (e.g. on southern Peru: Orlove 1990). Evidently, local and provincial 
(state) structures may create legal and other constraints that shape the nature 
and outcomes of labor market competition. Thus, market competition as a 
determinant of ethnic conflict rests in part upon prior political controls (cf 
James 1988). 

The accumulated historical studies repeatedly tell us that large-scale collec- 
tive conflicts are enormously complex mass phenomena. Thus Markoff notes 
that in rural France in 1789 there was not a unified peasant rising; rather, there 
were hundreds of varied local insurrections—"an amalgam of quite diverse 
conflicts" (Markoff 1985:761). 

Under conditions of extreme scarcity, informal economies develop—smug- 
gling, black markets, hoarding, diversion of resources from the public arena, 
official collusion. Such parallel systems tend to cohere around ethnic and 
regional social networks. One result is a sharpening of ethnic boundaries— 
within a class system shaped by the state (Chazan 1986:142-43). 

The respective parts played by class and ethnic factors in political opposi- 
tions can change through time. In the Quebec independence movement, early 
class differences were marked: lowest support among managers and propri- 
etors, farmers and workers; higher among the highly educated and intellectual 
strata. Over time, support increased in all classes, but the intelligentsia con- 
tinued to lead. These findings are consistent with evidence for the high im- 
portance of the intelligentsia in nationalistic movements in countries as dif- 
ferent as Nigeria, France (Brittany), Spain (Basques), and Trinidad (Pinard & 
Hamilton 1984). 

Ethnic activists include a high proportion of intellectuals—school teachers, 
mass media people, writers, publicists. They recently have been upwardly 
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mobile. They are articulate and ambitious and at the same time often occupied 
with a search for identity (cf Beer 1980). 

In the well-studied case of Belgium, ethnic opposition was preceded by 
struggles that were first defined in religious terms and then as class conflicts. 
In each period, the political solutions were constitutional, but the resulting 
arrangements were subsequently objects of ethnic confrontation. Analysis by 
Covell (1985: see Brass, ed.) suggested once more that it is when class and 
ideological or religious differences coincide with ethnic cleavages that the 
most severe conflicts will appear (Covell 1985:256—57). 

In a skeptical review of research on political protest, Zimmerman (1980:77) 
was willing to say that if there is one basic finding, it is that economically 
developed societies have less intense forms of political violence. It happens, 
of course, that the economically developed societies differ from others in many 
other potentially decisive ways, especially in the structure of political oppor- 
tunities. 

Ethnoregional studies offer puzzles. Recall that Beissinger (1992) presented 
data for the Soviet Union showing greatest violence in poorer and less urban- 
ized regions. But Beer's major work on ethnic activism in France (1980) 
reported that extra-electoral dissent was higher in the economically less de- 
veloped areas, whereas ethnic electoral political activity is more typical of 
"developed" regions. Horowitz (1985: chapter 6) developed the distinction 
between “backwardness” (disadvantage) of ethnies and of the region of con- 
centrated residence. His inventory of diverse cases shows that secessionistic 
movements tend to be frequent and early for backward groups in backward 
regions, somewhat frequent but late for advanced groups in backward regions, 
rare but early for backward groups in advanced regions, and, finally, rare and 
late for advanced collectivities in advantaged regions (Horowitz 1985:258). 
Special threats and opportunities help to explain such cases as Basques and 
Catalans in Spain and Slovenes and Croats in Yugoslavia (cf Aslund 1992:85). 

If ethnies are geographically concentrated in different regions of a sizeable 
country, the unevenness of environment is enough to create inequalities and 
thus potentials for conflict. Since ethnies are ascriptive, "ethnic homelands" 
favor boundary persistence and inequality (Lambert 1981, Ayoade 1986:110). 
Such ethnicity becomes especially salient when there is a perceived external 
threat. 


Demographic Factors 


Demographic shifts can powerfully affect ethnic relations. Between 1979 and 
1989, the following rates of population growth prevailed in these contrasting 
regions of the Soviet Union: Estonians 0.796, Latvians, 1.496, Tajiks, 45.596, 
Uzbeks, 34.096. Central state redistribution of resources under these conditions 
came to be seen as subsidizing rapidly growing groups, creating potential 
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migrants into the richer, slow-growing regions, and generating a long-term 
threat to cultural survival (Zaslavsky 1992:115). 

The 1990s have brought many examples in which rapid and large in-migra- 
tion of ethnically distinctive populations arouses fear and hostility. Thus, the 
surges of Bengalis into Assam waked fears of being "swamped" and led to 
demands that Assamese be the language of education and administration. The 
resulting complex developments eventuated in the violence of the 1983 elec- 
tions (WK Anderson 1990). 

The internationalization of labor supply in the long period of economic 
expansion from the end of World War II to the mid-1970s produced tensions 
in country after country between the status of citizenship and the status of 
in-migrant populations that have distinctive cultural or physical characteristics 
(cf Axford 1987:374—75). The large-scale international labor migration of 
ethnically identified populations into democratic polities raises political con- 
tention and episodes of violence, as in Germany, France, and Great Britain, 
among others (Esman 1986, Brubaker 1989, Horowitz & Noiriel 1992). 

High rates of population growth in poorer countries ensure pressures for 
international migration; when migration occurs, it results in diaspora and ethnic 
enclaves—which in turn increase the likelihood of ethno-political conflicts in 
receiving states and between exporters and importers of populations (Esman 
1992). There is evidence of a growing consensus in the European community 
in favor of ethnically restrictive immigration policies (Allen & Macey 1990: 
378). 

The movement toward an integrated European economy increasingly raises 
broader questions concerning immigration, labor mobility, ethnicity, and citi- 
zenship. National states still claiming sovereignty face increasing ethnic con- 
flict (Miles 1992). A typical response of national states is to impose restrictions 
on immigration. 

Very large international movements of population are increasingly volatile 
and unpredictable, and both economic inequalities and violent conflicts gen- 
erate great pressures for migration (Teitelbaum 1992). Accordingly, the likely 
prospects are for continuing ethnic tensions in receiving societies. 


State Versus Ethnies 


The accumulated research findings show not only how state structures affect 
and are affected by ethnic formations, but also how political changes react 
upon ethnies. Thus, the politicization of ethnicity has been a prominent feature 
of transitions, since 1970, from authoritarian rule in some 25 countries. Periods 
of transition relax suppression and open new opportunities for ethnic entrepre- 
neurs. Ethnies or ethnoregional groupings with advanced economies and great 
cultural differences vis a vis the dominant political groups are most likely to 
seek autonomy or national sovereignty (Frye 1992:623, cf Horowitz 1985 on 
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"advanced" and "backward" groups). Examples are: Basques and Catalans in 
Spain; Estonia, Lithuania, Latvia, Moldavia, Armenia, and Georgia in the 
Soviet Union; Slovenia and Croatia in Yugoslavia. Of 12 early seekers after 
autonomy, 10 were in advanced regions (exceptions: Slovakia and Kosovo). 

In general it appears that successful ethnic challenges to central states are 
more likely to follow a state crisis than to succeed because they produced a 
state crisis. Putative causes of rebellions against the state include income 
inequality, economic decline, state repression, international economic and 
political dependency, wars, invasions and interventions (Boswell & Dixon 
1990). Boswell & Dixon claim that their analysis of 63 countries points to 
three clusters of factors inducing violent rebellions; (i) relative deprivation 
(inequality and/or economic decline), (ii) resource mobilization (economic 
development; inverted U for state repression), (iii) dependency (indirect influ- 
ence through inequality, repression, and slow economic growth). 

We note that results of multivariate analyses of relationships between in- 
come inequality and deaths from political violence often show high sensitivity 
to unusual cases as well as to measurement and analysis problems (Weede 
1985, Muller 1985). Also, it is clear that rapid social change does not lead 
directly to political upheaval; the accompaniments are quite variable (Zimmer- 
man 1980:279—83). Nor do ethnic tensions and conflicts invariably lead to 
rebellions or separatist movements. Alternatives include: stable dominance of 
one ethny, together with cooperation of others; multiple power-sharing; and 
communal conflict and civil war. In some cases, however, persisting violent 
ethnic conflict signifies "endemic secessionism" (Mayall & Simpson 1992). 
Major instances are north and south Sudan and Eritrea and Ethiopia. Mayall 
and Simpson identify five major contextual conditions that help to predict 
whether ethnic cleavages will lead to secessionist movements: colonial poli- 
cies, post-colonial efforts by states to erase ethnic diversity, political economy 
of dissidence, religious (confessional) allegiances, and a regional environment 
of support or opposition (1992:9—10). In general, modernization appears to be 
much less important as a source of ethnic conflict than previously thought 
(Leifer 1981). 

Numerous studies have examined the proposition that collective protests 
and/or rebellions are greatest at intermediate levels of political opportunities 
and lowest under full access or extreme repression (see Jenkins & Schock 
1992:172—75). Conditions favoring collective action have been variously iden- 
tified as elitist regimes, moderate political freedom, moderate repression, ex- 
treme but incomplete repression over an extended period, elite divisions and 
political instability. Several of these circumstances are likely to be found in 
weakly institutionalized ethnocracies in multiethnic societies (cf DL Horowitz 
1985). There is considerable scholarly agreement that the type of political 
System strongly affects the frequency and intensity of protest behavior, i.e. 
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protests are more intense in elitist and autocratic than democratic states (Zim- 
merman 1980:173, Tiryakian & Rogowski 1985, Beissinger 1992, Gurr 1993: 
184—85). In general democracies have more extensive but less deadly protests. 

Once a society has been through a series of ethnically based political con- 
frontations and violent changes in control of the state, it typically has a 
narrowly based authoritarian regime—an ethnocracy. Such regimes are diffi- 
cult to displace, as witness Hafez Assad in Syria or Saddam Hussein in Iraq. 
As DL Horowitz notes (1985:499): *Only an escalation of the violence to the 
dimensions of civil war or invasion—and both require external assistance—is 
likely to dislodge an ethnocratic regime that has gone through a series of coups 
and purges."! 

In any case, regime changes, whether through coups or otherwise, are seldom 
a matter of indifference to other states that are either nearby or linked by 
economic or political ties. Thus in the coup-prone areas of West Africa, 
interstate conflicts typically follow the “radical behavior of emergent regimes" 
(Agyeman-Duah 1990:566). 

The need for explanatory schemes tailored to particular kinds of conflicts 
is well-illustrated in the case of military affairs, to which we now turn. 


Ethnicity, Military Forces, and Coups d'Etat 


Military and paramilitary forces have played central parts in Third World 
countries for several decades in the efforts of authoritarian regimes to maintain 
their power (Janowitz 1977). Like states as total institutions, military forces 
are both resources for, and objects of, ethnic conflict; in state after state 
ethnically linked military groupings intervene or threaten to intervene in ci- 
vilian politics. Seldom is the military ethnically neutral—whether because of 
the legacy of biased recruiting by colonial regimes, a cleavage between the 
military and political parties and the civilian government, or interpenetration 
of civil and military ethnic fractions (see the comprehensive treatment in DL 
Horowitz 1985: chapters 11-12). 

The importance of ethnicity in the recruitment, deployment, and political 
role of military forces has been well-recognized for several decades (cf Enloe 
1980, Kirk-Green 1980, Janowitz 1977, 1981). Military service, sometimes 
thought to be a homogenizing, even an integrative, process (e.g. in nineteenth 
century France) has proven in many cases to be ethnically divisive and a source 
of major conflicts. 

The importance of ethnic divisions as a factor in military rule varies greatly 
among regions of the world, e.g. very great in Africa and South Asia, joint 
with religious cleavages in the Middle East, much less in Latin America (Tilly 
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1992:219). Independence in Latin America came primarily through military 
revolts; in Africa, through peaceful means. In the latter, colonial forces had 
been ethnically skewed to make them as a-political as possible, with the 
consequence of unrepresentative militaries after independence (Welch 1986: 
321). 

The great difficulties of understanding military coups are highlighted in the 
recent turbulent history of South American countries (Markoff & Baretta 
1986). Of the many plausible but empirically dubious explanations proffered, 
few give any attention to ethnicity. For Africa, however, numerous studies 
have examined militarized ethnicity in detail (e.g Jackman 1978). 

Case studies identify important context and variables and generate evocative 
hypotheses (Barua 1992). To appraise causally important structures and pro- 
cesses, however, it is necessary to have more systematic multivariate compar- 
isons. For military coups, increased understanding has come from recent work. 

Of the 45 independent black states in Africa, from 1960-1982, nearly 90% 
recorded a military coup, attempted coup, or plot (Johnson, Slater & McGowan 
1984:627). There were 56 successful coups, 56 other attempts, and 102 plots. 
Political conflicts have made these sub-Saharan states among the most unstable 
in an unstable world. In the late 1980s, the central executive in some 25 of 
them was a military regime, and military influence was important in most of 
the others. The most complete available analysis of this extraordinary situation 
has reviewed four major explanations: political development (weak states and 
heavy demands), military centrality, ethnic cleavages or antagonisms, and 
economic dependency (Jenkins & Kposowa 1990). Although the results of 
detailed factor and regression analyses were highly complex—and differ for 
plots, attempts, and successful coups—the strongest predictor of military coup 
activity was military centrality, followed by ethnic cleavages and ethnic po- 
litical competition (Jenkins & Kposowa 1990, Kposowa & Jenkins 1993). 
Social mobilization (or *modernization") did not fare well as an explanatory 
condition: it did not affect coup activity through rising levels of political 
participation or domestic turmoil, although it was associated with greater 
likelihood of coups, possibly through enhanced communicative and organiza- 
tional resources. Economic dependence was not a strong or consistent factor. 
The net conclusion is that military centrality and ethnic tensions are the primary 
sources of irregular military interventions. Ethnic plurality and competition 
predict intervention whereas ethnic dominance lowers the likelihood of coup 
activity. Ethnic factors have this impact primarily in elite struggles rather than 
in mass turmoil. Cultural cleavages and rivalry between the two largest ethnies 
were primary contexts for military coups. The likelihood of coups was greater 
when there also was a strong military establishment, along with substantial 
political mobilization. 

Of course, coups constitute a limited set of events and are overshadowed in 
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consequences by large-scale communal violence, rebellions, and civil wars. 
Still, the regularities apparent in such events illuminate the continuing signif- 
icance of ethnies as actors in political arenas. 

In the larger, more deadly, conflicts there is by now abundant case material 
to support the conclusion that unilateral international involvements in conflicts 
between ethnies and states often, perhaps typically, result in lengthening, 
militarizing and escalating the struggle, in strengthening ethnic boundaries, 
and in reducing the likelihood of negotiated settlements (Nagel & Whorton 
1992). Among many cases are Angola, Iraq, Eritrea, Sudan, Lebanon, Sri 
Lanka, Iran, and several countries in Central America. 

Great variations in issues and in the extent of violence are noted in a close 
examination of ethnic conflicts, peace negotiations, and peace accords in 
several salient cases—Canada, Cyprus, India (Punjab), Sri Lanka, and Sudan 
(de Silva & de A Samarasinghe eds. 1993). With the exception of Canada, all 
the conflicts were substantially internationalized, reached high levels of vio- 
lence, and posed formidable difficulties for mediation or durable peace accords. 
Indeed, Richardson & Wang (1993) suggest that violent ethnic conflicts tend 
to be more protracted and more difficult to resolve than wars between states. 
Not surprisingly, also, a civil war seems more likely to develop into an 
interstate war when one party has ethnic links to another country (cf Blaine 
1973). 


DISORDER AND PROMISE IN RESEARCH 


The continuing chaotic state of general theories of collective violence still 
besets analyses. Thus, Rule's (1988) historical review of theories—rational 
choice, irrational collective outbreak, mass society, structural, resource-mobi- 
lization and others—fails to find central convergence, although particular 
models are deemed useful. 

Accounting-for-variance studies are notoriously difficult, and those that 
wager all upon a single variable typically are inconclusive. Thus, Lichbach 
cites “diverse and contradictory" findings from some 40 studies that support 
every conceivable relationship between economic inequality and political con- 
flict. This disconcerting circumstance has been variously attributed to problems 
of research design or statistical modeling and to deficiencies in formal models 
and theories. Eclectic work had failed to resolve the major puzzles. Remedies 
presumably must be sought in more rigorous studies that test crucial differences 
in predictions (Lichbach 1989). 

Further, we must acknowledge that satisfying explanations of collective 
conflict cannot be confined to multivariate "accounting" in which a cluster of 
conditions are shown to precede or accompany the outcome of interest. Timing 
and sequence must be analyzed as well—"system-level outcomes depend not 
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only on the configuration of actors, motives, and resources but also crucially 
on the sequence in which action occurs..." (Sewell 1987:171). 

What we now have in stock can be briefly summarized. First, there is a vast 
set of richly descriptive case studies, which suggest hypotheses, record phe- 
nomena ("what is possible"), and clarify concepts. Second, there are numerous 
models (or, conceptual schemes), often of considerable evocative value. Third, 
many studies usefully compare a limited set of case studies. Fourth, new and 
promising analyses are appearing that use statistical techniques to analyze large 
data-sets. From all these resources, gains are being made. An example is in 
order. 

Defining a universe of 227 large “communal groups"—those with culturally 
distinct and persistent identities that do not have recognized states or institu- 
tionalized political status, Gurr (1993) has produced a massive analysis of 
conditions associated with political mobilization, protest, and rebellion against 
the state. The study identifies five overlapping types of communal groups: 
ethnoclasses, ethnonationalists, militant sects, indigenous peoples, and com- 
munal contenders. Although having common characteristics, each of the types 
is distinctive in important ways. The complex findings defy brief summary, 
but contain such surprises as the weak effects on mobilization of such “re- 
sources" as size, concentration, and coherence (Gurr 1993: 179), and the 
significant apparent influence of contagion and diffusion on protest and rebel- 
lion among militant sects and indigenous peoples (pp. 180, 189). A general 
explanatory model is proposed that combines the perspectives of “grievance” 
and "political interest" (mobilization). Both sets of factors are seen to interact 
in situations of protest and rebellion. 

The general scheme proposes that "objective" conditions such as economic 
deprivation, discrimination, and loss of political or cultural autonomy provide 
the basis for grievances, which are then used by entrepreneurs to mobilize 
collective action. The outcomes of mobilization, in turn, depend upon op- 
portunity structures including international influences. Gurr has summarized 
the main findings as follows (1993:161): "Statistical analysis shows that 
cultural identity, inequalities, and historical loss of autonomy all contribute 
to their grievance. Political mobilization, grievances, and the international 
diffusion and contagion of communal conflict jointly explain the extent of 
political action in the 1980s. Democracy, state power, and institutional 
change help determine whether conflict [opposition] takes the form of protest 
or rebellion." (Reprinted with permission of Butterworth-Heinemann Ltd and 
the author.) 

This study along with hundreds of others allows us to infer the most lethal 
configurations for ethnic conflict. The recipe for this Devil's Brew would 
include: multiethnic population, centralized state with redistributive powers, 
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substantial military forces, a few large ethnies of nearly equal political re- 
sources, territorial concentration, marked inequality and/or rapid change in 
ethnic positions, and ethnic political struggles for collective goods. Mix well, 
and run for shelter. 

Space limits foreclose a discussion of strategies for managing such conflicts. 
Considerable knowledge does exist. 


CONCLUSION 


The difficulties of research in this field are well-known. The phenomena of 
central interest—ethnicity, states, conflict—are of stunning complexity, requir- 
ing both disaggregation and multivariate analysis. The available data typically 
are cross-sectional, whereas repeated observations over time are needed for 
the dynamic analyses that could begin to disentangle causal sequences. Data 
often are crude and of uncertain validity, e.g. many conflicts go unreported 
and accounts may be incomplete and distorted. Information on microprocesses 
is scarce. Sequential-narrative analysis is in an early state of development. As 
we have just seen, a profusion of concepts, models and dominant-factor the- 
ories hinder a coherent research strategy and impede the organization of 
cumulative knowledge. 

None of these difficulties is insurmountable. Indeed, the work here reviewed 
has produced much new substantive knowledge and conceptual clarification. 

In spite of numerous controversies and confusing empirical findings, the 
same basic conditions for ethnic conflict repeatedly are identified in the works 
here reviewed. Most important are: 


(i) strong ethnic identities/boundaries; 
(ii) grievances; 

(iii) opportunity structures; 

(iv) resource mobilization. 


An implication is that, at the very least, any research that seeks valid 
explanations should attend to all these simultaneously. Future studies should 
carefully specify the type of conflict that is in question—whether nonviolent 
protest, riots, pogroms, mutinies, coups d'etat, communal violence, low-in- 
tensity conflict (sabotage, kidnapping, assassinations, bombings, extortion), 
major armed combat. Type of ethny and extent and kind of grievances must 
be similarly identified. Linking the configurations of structural conditions to 
processes of mobilization and counter mobilization will open new possibil- 
ities for understanding some of the most intractable and tragic conflicts of 
our times. 
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Abstract 


This review pulls together research on home leaving, home returning, parent- 
child coresidence, and the launching process and integrates it with theoretical 
perspectives related to the life course. Material is included from Australia, 
Canada, and Britain as well as the United States. Because the nuclear family 
form specifies that children leave the parental home when they marry and few 
married persons live with their parents, the focus is on relationships between 
unmarried children and their parents. The review concentrates on the young 
adult years while recognizing coresidence experiences over the life course. 
The nature of the link between residence patterns, family relationships, and 
dependence is an empirical question that, for the most part, has not been 
answered. 


GENERAL OVERVIEW 


The likelihood that an adult child will reside with parents declines very sharply 
after age 18: In 1990, 74% of 18-19 year olds lived with parents, compared 
to 40% at ages 20-24 and 16% at ages 25-29 (US Bureau of the Census 19922). 
A substantial minority of the unmarried, however, continues to live at home 
into middle age. Three recent studies using different data sets asked what 
percentage of over-65 parents have coresident adult children; all arrived at 
figures in the range of 13-15% (Aquilino 1990, Ward et al 1992, Speare & 
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Avery 1993). Some of these children are young adults who have not left home 
yet, others are temporary returnees, but some are 35 or 40 and home to stay. 
Although coresidence declines with child's and parent's age, it remains a 
significant phenomenon across the life course. 

Media reports suggest a widespread perception that this generation is 
uniquely slow in establishing independence from its parents (e.g. Littwin 1986, 
Kuttner 1988). There is little empirical support for such a perception. The 
percentage of all young adults who lived with a parent does indeed show a 
shallow U-shaped pattern between 1940 and 1990, but the very modest in- 
creases in overall coresidence since 1960 are far overshadowed by increases 
in the population realistically at risk of coresidence—the unmarried. If we 
restrict our denominator to never-married adults, coresidence rates have de- 
clined steadily since 1940. Among unmarried men and women aged 18-24, 
for example, coresidence declined from 80-85% in 1940 to 56% in 1990, At 
ages 25—29, the declines were from approximately 80% to 40% over the same 
time period (US Bureau of the Census 1943, 19922). What has happened is a 
return to the late marriage patterns characteristic of the first part of the twentieth 
century, accompanied by a substantial decrease in the likelihood that unmarried 
children will live with their parents. 

If we take a longer perspective, there is even less evidence that this gener- 
ation is slow to establish residential independence. Historical evidence suggests 
substantial diversity in unmarried children's homeleaving behavior in the 
West: although many adolescents from rural and working-class backgrounds 
left home at very early ages to enter an apprenticeship or domestic service, 
others stayed home until relatively late marriage (Wall 1978). Like other 
dimensions of the transition to adulthood, the major historical change in home 
leaving is the extent to which it is now concentrated in a relatively narrow age 
range. In past centuries (and in Latin America today), children left home at 
any age between 10 and 30 (Wall 1978, DeVos 1989); in western Europe and 
North America today, the end of coresidence occurs rather abruptly between 
ages 18 and 21. 

The larger story, and the one on which we have much less information, is 
change in parent-child obligation and its relationship to living arrangements. 
In the past, unmarried children who stayed home were expected to be net 
contributors to the household division of labor and to submit to their parents" 
authority (Mayer & Schwartz 1989). Those who lived away from home were 
more likely to send a portion of their wages home than to receive parental 
subsidy for their privacy and independence. In Caldwell's terms (1976), wealth 
and deference flowed up the generational ladder. The changes observed in 
household composition may obscure more dramatic changes in dependency 
and obligation between coresident parents and children. 
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THEORETICAL PERSPECTIVES AND CONCEPTUAL 
FRAMEWORKS 


Coresidence can be viewed through many lenses, each of which raises some- 
what different questions and points us to different explanations. Here these 
frameworks have been grouped under life course, macro-structural, and ex- 
change perspectives. 


The Life Course Perspective 


The family life cycle perspective assumed a fixed set of developmental stages 
through which families moved from birth to death. In this perspective, the 
empty nest stage is inaugerated by the marriage of the lastborn child rather 
than by home leaving, a transition not treated separately. The family life cycle 
model has proven a useful device for organizing demographic data about 
average experiences of historical cohorts, but has been criticized for its failure 
to incorporate the experience of those who divorce or who never marry. As a 
result, many scholars have turned to the life course perspective, a more flexible 
model that emphasizes sociohistorical variability and focuses on transitions 
rather than stages, a wider variety of destinations, more variability in pathways, 
and the possibility of countertransitions (Hohn & Mackensen 1989). It is still, 
however, very much a structural and normative framework. According to 
Hagestad & Neugarten (1985:36), the life course perspective "concentrates 
on age-related transitions that are socially created, socially recognized, 
and shared" (p. 35, emphasis in original). Thus, changes in social norms and 
in the transition to adulthood are considered here as part of the lifecourse 
perspective. 


NORMATIVESTRUCTURES Writing of the United States at midcentury, Parsons 
(cited in Grigsby 1989) stated that “For young people not to break away from 
their parental families at the proper time is a failure to live up to expectations, 
an unwarranted expression of dependency." We have little evidence, however, 
about the strength of these norms or whether young people or their parents 
feel compelled to honor them. 

As Marini (1984) has noted, age norms are in general very weak: sanctions 
are few and parameters are so wide as to be meaningless. Nevertheless, studies 
of young people and their parents suggest a general expectation of leaving 
home before marriage. Goldscheider & Goldscheider (1987, 19892) show that 
approximately two thirds of high school seniors—pretty much regardless of 
race, ethnicity, class, and religion—and a somewhat smaller majority of parents 
expect children will live independently before marriage. 

One study of college students asked directly about the circumstances under 
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which it was acceptable for adult children to live with their parents. Shehan 
& Dwyer (1989) report that most students felt 18 and 21 were appropriate ages 
to leave home and that financial difficulties were the chief legitimating factor 
for staying longer than this. Boys were more likely than girls to think parents 
had an obligation to house their children and less likely to think that children 
had an obligation to pay. Sheehan & Dwyer (1989) conclude, many children 
“feel they have the right to expect such aid" (110). 


TRANSITION TO ADULTHOOD The end of coresidence and the establishment 
of independent living fall within the part of the life course known as the 
"transition to adulthood." Using indicators such as ending schooling, getting 
married, having children, and becoming a fulltime member of the labor force, 
a substantial research literature documents trends in this transition over the 
last century (e.g. Hogan 1981). 

Although studies of home leaving and home returning are often framed as 
part of the transition to adulthood, scholars who focus directly on the transition 
seldom or never use evidence on home leaving as an indicator (Hogan 1981, 
Hogan & Astone 1986, Rindfuss 1990, Modell 1989). The omission is ironic 
given the ready availability of long time series and the readiness with which 
home returning can be viewed as symptomatic of the "disorder in the life 
course" (Rindfuss et al 1987). 

Recent work on the transition to adulthood in Great Britain is an important 
exception, Kerckhoff (1990:16—19) compares reports from the unique British 
National Child Development Study and the US National Longitudinal Study 
(NLS) of the Class of 1972. For British males, he reports a median age of 
home leaving of 21.7, 5.4 years after leaving school and entering a fulltime 
job, but 4 years before first marriage. For US males, the first three transitions 
are much closer: median age left home was 18.3, 0.5 years before leaving 
School and before entering fulltime work, but 4 years before first marriage. 
Women in both nations left home earlier and married earlier, but the same 
general pattern exists: Home leaving is a midpoint in the transition to adulthood 
for British women but occurs simultaneously with leaving school and getting 
a fulltime job for US women. 

The omission of coresidency by most students of the transition to adulthood 
may reflect everyday life perspectives. Barber's (1989) interviews with a small 
number of families suggest that neither parents nor children see moving out 
as an important marker of adulthood. Jones (1987) concludes that adult status 
is determined by marriage and family formation, with leaving home more often 
associated with adolescent than adult status. Obviously, we need to know more 
about the role coresidence plays in the transition to adulthood: empirically, 
normatively, conceptually. 
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Macro-Structural Perspectives 


The growth in nonfamily living among young adults may be seen within the 
larger historical pattern of growing individualism and reduced importance of 
family living; other indicators of these are elderly living alone, divorce, and 
cohabitation (Goldscheider & Waite 1991). Although some scholars attribute 
the change to secularization (Lesthaeghe 1983), others emphasize the role 
played by the affluence and security of contemporary society (see Finch 1989). 
The chief candidates to explain cross-national and historical differences are 
economic, legal, and demographic structures. 


ECONOMIC CHANGB Most life-course theorists at least mention stratification 
(e.g. Rossi 1980), and there is general agreement that nonfamily living depends 
on the structure of economic opportunity. The development of a post-industrial 
service economy has had two opposing implications for young people: Avail- 
ability of low-skill, entry-level, and parttime positions has increased the labor- 
force participation rate of young adults, especially young women, but real 
earnings for fulltime, fullyear workers are falling. Between 1979 and 1989, 
for example, the proportion of male fulltime, fullyear workers ages 18-24 
earning "low" wages (below the poverty level for a family of four) jumped 
from 18% to 35% of all fulltime fullyear workers; for women, the change was 
from 29% to 43% (US Bureau of the Census 1992b). 

Although scholars of coresidence are certainly aware that these changes 
have increased the possibility of children's independence while simultaneously 
reducing the standard of living they can achieve independently, change in 
economic structure has not been directly examined in empirical studies of 
coresidence. In part, this omission stems from the individual and short-term 
nature of most empirical work. Even longitudinal studies generally follow a 
single cohort for only a few years. Although surveys measure individual 
unemployment, they cannot easily incorporate local or historical differences 
in macro-employment demand. It would appear, however, that declines in real 
earnings may contribute to the increasing proportion of young adults who 
remain unmarried at the same time as the opportunity to earn independently 
encourages them to leave their parents' home. 


POLITICAL STRUCTURES Mayer & Schwartz (1989) note that many of the 
timing mechanisms that govern the life course are state controlled. Draft laws, 
educational practices, tax laws, welfare and housing policies, and child support 
guidelines make implicit and explicit statements about expected ages of inde- 
pendence. One such signal is found in Pell Grants, which provide tuition 
payments for low-income US college students: Students under 25 are assumed 
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to have access to their parents’ resources unless they can prove that they are 
independent; after 25, they are assumed to be independent and no proof is 
required. 

Rules for welfare or public housing eligibility may indirectly foster or retard 
coresidence. For example, virtually the only rental housing available in Britain 
is public housing; because it gives preference to married couples and parents, 
unmarried nonparents do not generally have the option of living independently 
(Kiernan 1989). In Australia, on the other hand, 16 and 17 year-olds who 
cannot live with their parents (because of conflict or parental absence) can 
leave home and take their family allowance with them; they are also eligible 
for public housing (Young 19872). 

A European Community commission concerned with forced doubling up of 
young adults and their parents and a parallel rise in youth homelessness has 
concluded that 

Leaving home at an age that is in accord with modern life styles and aspirations 

about mobility, independence, and personal development, under conditions that 

Offer access to adequate accommodation, is a natural part of the process of growing 

up and should be recognized as an important social right... [and] housing policy 

should be revised towards an increase of opportunities and freedom of choice 

(Emmanuel 1989:45, emphasis in original). 


Australia's policy regarding child allowances may be interpreted as a small 
step in this direction. Other governments, however, seem more inclined to 
encourage coresidence as a means of making families rather than the state 
responsible for dependents (Burton 1989, Finch 1989). 


DEMOGRAPHIC FACTORS Demographic factors—fertility, mortality, mar- 
riage, and even migration—play a role in coresidence by determining avail- 
ability of kin (DeVos & Palloni 1989). Higher levels of fertility increase the 
likelihood that parents will have a resident child, but decrease the length of 
time that any individual child will remain at home. Because unmarried chil- 
dren are much more likely to reside with parents than married children, lower 
marriage rates are generally paralleled by increases in coresidence. The di- 
vorce of a child enhances the likelihood of coresidence, but the divorce of 
parents decreases it. 


Exchange Theory 


On the micro level, exchange theory is by far the most common theoretical 
framework. Children are assumed to assess the costs and benefits of living 
with their parents compared to alternative living arrangements and to choose 
the arrangement that offers the most highly valued benefits. Under adverse 
circumstances, strictly economic factors may play the critical role. Boyd & 
Pryor (1989), for example, argue that high housing costs and low wages "trap" 
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children in their parents’ home. Although the models can encompass qualita- 
tive factors (such as a desire for privacy and freedom from supervision vs a 
preference for built-in companionship and lower responsibility for maintaining 
the infrastructure of daily life), empirical models tend to be restricted to easily 
measured structural factors such as employment and education. 

The cost/benefit perspective is less often applied to parental decisions. An 
exception is Avery and associates (1992), who conclude that, especially in the 
later stages of nest leaving, better-off parents subsidize their children's depar- 
ture in order to achieve greater privacy. 

Exchange theory has some obvious limitations. Empirically, age is the 
strongest predictor of home leaving, and cost-benefit considerations may be 
constrained by the fact that leaving home is part of normal developmental 
processes. The microeconomic perspective used in studies of home leaving 
can also be faulted for its extreme individualism. Although there is some 
recognition that parents' and children's goals may conflict (Goldscheider & 
Goldscheider 19898), microeconomic frameworks have generally failed to put 
children's choices in the context of other (generally more powerful) actors or 
to put coresidence into equity or justice frameworks. While some research 
takes the parents’ point of view and some takes the child's, there is little 
research that focuses on the negotiation of coresidence. An exception is Mc- 
Elroy (cited in Nelissen 1990), who considers coresidence decisions within a 
bargaining model that includes the utility functions of parents and youth 
simultaneously. 


DIFFERENTIALS IN HOME LEAVING AND 
RETURNING: EMPIRICAL FINDINGS 


Because all but a minority of children eventually leave home, home-leaving 
research tends to concentrate on timing, differentials, and destination choices 
rather than the fact of home leaving. The empirical findings from this research 
are organized below by major structural differentials. Methodological and 
conceptual issues in defining home leaving are reviewed first. 


Who Is a Home Leaver? Methodological and Conceptual 
Issues 


For parents and children as well as researchers, defining the point at which a 
child has left home is ambiguous. Are we measuring first home leaving or last 
home leaving? Because nearly half of young people leave more than once, the 
choice is an important one (Young 1989). Differentials that are strong in the 
case of early or first home leaving may be largely eliminated when we look 
at final home leaving. The clearest example of confusion over operational 
definitions is the long-standing difference between the Current Population 
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Survey and the Census in how to count college students (noted in Goldscheider 
& DaVanzo 1986). The Census uses a de facto rule and counts college students 
as absent; the Current Population Survey, which uses a de jure rule, counts 
them as members of their parents' household. 

A substantial number of researchers have adopted the term semi-autonomy 
to cover this grey area. Semi-autonomy is a term developed by Katz & Davey 
(1978) to describe situations where young adults are away from the control 
and supervision of their parents but under the authority and supervision of 
some other agency (e.g. in another family's household as servants, in a super- 
vised boarding house, in college dormitories, or in military barracks). The term 
has been extensively used in studies of coresidence to distinguish young adults 
who are away at college or in the service (Young 1987a, Goldscheider & 
DaVanzo 1989). Whether college students and members of the armed forces 
are treated as coresident is largely an issue for those who focus on early 
homeleaving. Using data on white youths from the Detroit Area Study, Thorn- 
ton and associates (1993) report that although only 696 of all two-week periods 
between ages 18 and 24 were spent in group quarters, nearly half of their 
sample spent some time in such quarters at age 18. 

Some scholars are interested in the degree to which young adults living 
away from their parents are independent (a continuous, multidimensional vari- 
able), while others are interested in coresidence (a simple dichotomy). These 
are obviously separate constructs, and the relationship between them an em- 
pirical question. If the focus is on coresidence, then college students and service 
members should be treated as leavers and their parents as empty-nesters. 


Race/Ethnic Differentials 


A rather extensive literature shows that African Americans and Hispanics are 
substantially more likely than non-Hispanic whites to live in extended families 
at all ages and marital statuses (Tienda & Angel 1982, Beck & Beck 1984, 
1989). For example, Hogan and associates (1990) show that, among unmarried 
mothers ages 19—26 in 1984, 4596 of black and 2296 of white women lived 
with their mothers. Focusing on unmarried, childless adults over 25, Hernandes 
(1989) reports that approximately 5096 of Black and Hispanic adults lived with 
family members in 1980 compared to 3596 of whites. Echoing findings by 
Tienda & Angel (1982), Hernandes reports that Puerto Ricans are more likely 
than other Hispanics to live away from family. Expanding this analysis to 
Asian ethnic groups, Kanjanapan (1989) demonstrates that Vietnamese, Ko- 
rean, Filipino, and Chinese young people over 18 are substantially more likely, 
but Japanese Americans are less likely, to live with family than are whites. 
“For all groups, there is a consistent and powerful [negative] relationship 
between the extended family living arrangement and duration of residence in 
the United States" (1989:52). 
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Generally, research on home leaving and home returning is consistent with 
the differentials reported above. Goldscheider & DaVanzo (1989) show that 
blacks are less likely and Asians much less likely to be home leavers, net of 
parents' and children's resources and community characteristics. The low rate 
of home leaving for blacks is largely attributable to lower likelihood of leaving 
home for marriage. 

Turning to parents' experiences, Aquilino (1990) draws on data from the 
National Survey of Families & Households (NSFH) to show that, among 
parents with at least one child over 19, 54% of Mexican-American parents 
were living with a child, compared to 4096 of black and 2896 of non-Hispanic 
white parents. Differential probability of having unmarried children accounts 
for most of the racial difference in parents' experience with coresidence (net 
of all control variables, black parents with unmarried children are no more 
likely than white parents with unmarried children to have coresident children) 
but cannot account for the substantially greater likelihood that Hispanic parents 
have coresident children. Aquilino does not control for length of time in this 
country, a variable that, as Kanjanapan (1989) suggests, may help explain a 
cultural preference for coresidence. 

The explanation for race and ethnic difference in coresidence is often con- 
ceptualized as a choice between cultural preference and social structure. For 
black/white differences, evidence supports a structural explanation. In addition 
to Aquilino's evidence showing that race has no effect on parents' experience 
net of controls, a structural interpretation is bolstered by Glick & Lin's (1986) 
evidence of a dramatic reversal in black/white coresidence rates between 1940 
and 1984: in 1940, whites ages 18-29 were 40% more likely than blacks to 
live with their parents, but in 1984 they were 2296 less likely. Declining 
economic opportunity and reduced marriage rates for black Americans seem 
more likely explanations of this change than an increase in the preference of 
black parents and children for family living. The higher coresidence rates of 
Hispanic youths, however, are more likely to be at least partially attributable 
to cultural differences. 

Racial differences in coresidence are much stronger among the married than 
the unmarried (Tienda & Angel 1982, Beck & Beck 1989, Hogan et al 1990, 
Trent & Harlan 1990), and it seems likely that other factors may operate 
differently across racial and ethnic groups. Although a few studies look sep- 
arately at blacks and whites and most include race and Hispanic ethnicity as 
control variables, few include interaction effects to test whether the processes 
work similarly. This makes it difficult to test competing explanations of cul- 
tural difference versus social class. Among the groups that we know least about 
is married or childless minority young adults. In the flurry of work comparing 
black and white mothers, childless women and young men have been ne- 
glected. 
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Gender Differentials 


Women leave the parental home earlier than men, and they are less likely to 
return home. All or most of these differentials are accounted for by women's 
earlier age at marriage (Goldscheider & DaVanzo 1985, Young 1987b, Da- 
Vanzo & Goldscheider 1990). A counterbalancing factor is that unmarried 
mothers are more likely to live with their parents than unmarried fathers 
(Goldscheider & DaVanzo 1985, Aquilino 1991b). Among both the married 
and the unmarried, males are more likely to return home (DaVanzo & Gold- 
scheider 1990). 

Women's lower age at marriage is the key factor producing their lower rates 
of coresidence, but differences in parent-child relationships also may play a 
role. Several scholars have suggested that girls who stay in the parental home 
may be supervised more closely than boys (Boyd & Pryor 1988, Ward & Spitze 
1992) and expected to do more housework. Although the supervision differ- 
ential remains speculative, Goldscheider & Waite (1991) show that sons 
make only 10-20% as much contribution as their sisters to household labor. 
"[G]rown sons are being subsidized by their parents both financially and in 
terms of services... [so that] staying home after age 18 seems to provide much 
less benefit for daughters than sons" (1991:149). If the costs and benefits of 
coresidence differ by gender, gender-neutral models may miss important ele- 
ments. 


Class Differentials 


Empirically, the likelihood that unmarried individuals of any age will live 
alone is positively and strongly associated with income, and the historical 
increase in single-person households at all ages is attributable largely to in- 
creasing affluence (Michael et al 1980). These findings are consistent with the 
thesis that privacy and independence are highly valued outcomes, the achieve- 
ment of which waits only on necessary resources. 

The empirical evidence on home leaving is generally consistent with this 
thesis. The majority of studies in the United States and Europe show a clear 
link between children's income and employment status and coresidence: Chil- 
dren with more resources are less apt to live at home (Short & Garner 1990, 
Speare & Avery 1993) and less likely to return home (DaVanzo & Goldschei- 
der 1990). This effect may be weaker in early adulthood when coresidence is 
less selective (Avery et al 1992, Ward et al 1992). 

Children's resources are much more likely than parents' resources to buy 
them independence. A variety of structural factors offset the greater ability of 
better-off parents to purchase independence: their houses are bigger and less 
crowded, their marriages are more likely to be intact, and their children marry 
later. Avery and associates (1992) speculate that higher income parents use 
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their resources to prevent children from marrying early, and to encourage them 
to go away to school and live independently, thus washing out an overall social 
class effect. 

Social class may affect the relationship between coresidence and children's 
access to parental resources more than it affects household composition itself. 
For better-off parents and children, residential independence is possible while 
maintaining a relatively large flow of resources from parents to children. 
Among the working and lower class, however, coresidence may be necessary 
if children are to have access to parents’ resources. Thus the connection 
between coresidence and flow of resources is probably stronger as one goes 
down the social class ladder. 


Family Structure 


Research from the United States, Canada, Britain, and Australia demonstrates 
consistent effects of childhood family structure: Those with more siblings 
(Young 1987b, Mitchell et al 1989, Kerckhoff & Macrae 1992) or stepfamilies 
and other forms of nonintact families (Mitchell et al 1989, Aquilino 1991a, 
Kiernan 1992) leave earlier. Nonintact families may be a more important 
deterrent to coresidence at younger than at older ages (Jones 1987, Aquilino 
1990). 

The earlier home leaving of stepchildren has been attributed to lower cohe- 
sion in stepfamilies (White & Booth 1985). This supposition is confirmed by 
findings that stepchildren more often cite family conflict as the reason for 
home leaving (Kiernan 1992) and that stepparents and stepchildren are less 
likely than those from intact families to expect children to live at home until 
marriage (Goldscheider & Goldscheider 19872). The effect of stepfamily status 
on early home leaving appears to be strongest for whites and Asians and 
weakest for African Americans (Goldscheider & Goldscheider 1993) and 
stronger for daughters than sons (Aquilino 1991a, Goldscheider & Goldschei- 
der 1993). 

Two studies suggest that the general finding that children from single-parent 
families leave earlier may be qualified. Aquilino (1991a) shows that US chil- 
dren growing up with a never-married single parent are not significantly more 
likely to leave early than children from intact families, and Kiernan’s (1992) 
British data show an effect for young women but not young men. Looked at 
from the parents’ perspective, however, the picture is clear: currently divorced 
parents (especially fathers) are less likely to have coresident children (Aquilino 
1991b, Ward et al 1992). 

The child's own family structure also has an impact. All studies confirm 
that married children are much less likely to live at home and less likely to 
return home, though this effect is apparently stronger for non-Hispanic whites 
than for Blacks. The effect of parenthood is less clear. Relying on evidence 
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from the NLS high school class of 1972 observed from 1972 to 1979, Gold- 
Scheider & DaVanzo (1985:559) conclude that “for each sex and marital 
status—with the exception of single males—young adults who are themselves 
parents are more likely to be residentially dependent on their own parents than 
are comparable nonparents." This finding has not been replicated. Several 
studies document that presence of own children reduces unmarried children's 
coresidence (Avery et al 1992, Ward et al 1992) and decreases the likelihood 
of home returning (DaVanzo & Goldscheider 1990). If true, this sets up an 
interesting dynamic with race: Black women are more likely to be unmarried, 
which increases likelihood of coresidence, but more likely to be parents, which 
decreases it. Thus, we may find rather different processes contributing to 
coresidence by race. 


Home Returning 


Roughly half of children return home for at least a brief period after their initial 
leaving. Goldscheider & DaVanzo (1986) report figures of 55% for school 
leavers and 6496 for service leavers. Kerckhoff & Macrae (1992) show that 
approximately one third of British homeleavers return. From Australia, Young 
(1989) reports that 5196 of men and 4096 of women return at least once. 

Returning is most likely in the first year or two after initial home leaving. 
Young (1989) reports a mean length of time between initial home leaving and 
first home returning of 2 years and an average time between returning and 
leaving again of 2.5 years, giving a 4—5 year launching period. Compared to 
those who leave only once, returners' first home leaving occurs at an earlier 
age, but their final home leaving occurs at a later age. 

From all three countries, we find the same pattern for home returners: Those 
who are married, older, have been away a longer time, or from larger or 
nonintact families are less likely to return. Young women are less likely than 
young men to be returners, a difference almost entirely accounted for by the 
fact that women are more likely to be married. In the most complete analysis 
of home returners, Goldscheider & DaVanzo (1986) report that probability of 
home return is inversely related to probability of having left in the first place. 
Those who are older, female, still in school, have higher income, are married 
or parents, come from a nonintact family, lived a shorter time in their parents' 
community, have been away longer, or are non-Hispanic white are significantly 
less likely to return. Those who leave military service, school, employment, 
or marriage are more likely to return home. Data from Australia (Young 1989) 
and Britain (Kerckhoff 1990) demonstrate that those who leave on account of 
family conflict have a relatively high rate of returning home, largely because 
they are likely to have left prematurely. Young reports that the subsequent 
home leaving is also often sparked by conflict, "suggesting a cycle of stormy 
departures and unsuccessful attempts at living independently" (1987a:10). 
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Is home returning a sign of weakness, as the evidence about unemployment 
and divorce suggests? DaVanzo & Goldscheider (1990) examine this hypoth- 
esis and conclude that role transitions rather than role failures are most often 
associated with returning home. They make a distinction between a "safety 
net" image (implying a slip) and a "home base" image (implying a normal 
base of operations). This conclusion should probably be restricted to the early 
adult years covered in their study. Children who are coresident after age 25 
are more likely to be selected for problems (Grigsby & McGowan 1986, Speare 
& Avery 1993). Following a group of black adolescent mothers from 1967 to 
1983, Furstenberg et al (1987) report a negative association between years 
lived with parents in the first five years after the (grand)child's birth and the 
young mother's economic circumstances 17 years later. They conclude that 
“those who remain or return to the parental household may be the less suc- 
cessful members of our study" (1987:57). 


Summary 


A very substantial amount of parent-child coresidence characterizes the young 
adult years. This declines sharply with age, but approximately 1096 of unmar- 
ried children are still living at home in middle-age (Speare & Avery 1993). 
Among the youngest adults (18-24), nearly half of those who live with their 
parents are attending school. After 25, however, only a smell minority of 
coresident children are still attending school. Some are unemployed, some are 
low earners, and perhaps 8-9% have physical or mental health problems and 
require assistance with activities of daily living (Speare & Avery 1993). For 
many at all ages, however, home may simply be “cheaper, friendlier, and more 
comfortable" (Young 1989:182) than the alternatives. 


PARENTS' AND CHILDREN'S PERCEPTIONS OF 
CORESIDENCE 


Why are children living with their parents, and how does it work out for parents 
and children? The gilded cage/feathered nest question posed by Avery and 
associates (1992) implies disjunctive combinations: parents bribe unwilling 
children to stay home or lazy children sponge off unwilling parents. In this 
section, parents' and children's perceptions of coresidence and the launching 
process are reviewed. 


Parental Assessments of Coresidence 


Research is quite clear in demonstrating that parent-child coresidence— 
whether the child is 5 or 50—is largely for the child's benefit rather than the 
parent's (Aquilino 1990, Ward et al 1922, Speare & Avery 1993). Although 
some adult children are caring for dependent parents, “Parental dependency 
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cannot be assumed, at any point in the life course, to be a causal factor in the 
decision of parents and children to share a home" (Aquilino 1990:410). In the 
majority of cases, children live in parents' households rather than vice versa; 
on the average, parents living with adult children do three quarters of the 
housework and contribute 80% of household income (Grigsby 1989). 

Despite the continued dependency often signalled by coresidence, Aquilino 
(1991b) concludes that parents generally have positive assessments of coresid- 
ence. This is partly a selection effect: prolonged coresidence is likely only 
when parents and children have a harmonious relationship and are part of an 
intact family. When NSFH respondents with children ages 19-34 living in the 
household were asked to give a 7-point rating to the statement, "Taking all 
things together, how does having your son/daughter living here work out for 
you?", 7096 responded with a 6 or a 7. Nevertheless, 7296 reported at least 
some open disagreements in the last year and a full quarter reported having 
disagreements once a week (Aquilino & Supple 1991). 

Several scholars have demonstrated that parents' relationships with their 
children—regardless of where children live—are significantly related to the 
child’s economic dependence (Aquilino 1991b, Aquilino & Supple 1991, Kulis 
1992). The issue is not whether children pay room and board, but whether they 
depend on their parents for day-to-day expenses (Aquilino 1991b). The extent 
to which coresidence is perceived as unfairly burdensome by parents seems 
to depend much less on whether the child is too old than on whether the child 
is too dependent (Schnaiberg & Goldenberg 1989). 

Aquilino (1991b) finds a negative association between indicators of social 
class and parents’ satisfaction with coresidence, though parents’ education is 
more important than their income in predicting dissatisfaction. Part of the 
reason may be the greater expectations that middle-class parents and children 
have for independence (Goldscheider & Goldscheider 1989a, Schnaiberg & 
Goldenberg 1989). In addition, middle-class children are likely to cost more 
and contribute less. Certainly, evidence suggests that adult children are more 
likely to make an economic contribution in low-income families (Tienda & 
Angel 1982, Grigsby 1989, Goldscheider & Goldscheider 1991). 

Studies assessing parent’s wellbeing (Aquilino 1991b) and relationships 
with their children (Aquilino & Supple 1991) show that coresident children 
are more likely to be a negative factor for black parents, for better-educated 
parents, when the child has been married, brings grandchildren into the home, 
or is unemployed. No effect was observed for child's gender nor, surprisingly, 
coming from a stepfamily. The relatively rare cases where adult children 
continue to live in stepfamilies are likely to be selected for unusually good 
parent-child relationships (Aquilino 1991b). Aquilino (1991b) concludes that 
the effect of coresidence on parents’ well-being depends more on which 
children are living at home rather than whether there are children at home. 
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Parents are stressed when their children have problems, and this seems 
especially likely if the children live at home (Pillemer & Suitor 1991). Aquilino 
(1991b) and Aquilino & Supple (1991) have shown that co-residence is less 
satisfactory if children have problems such as divorce and unemployment. 
Although we have no documentation, we can expect that the coresidence of 
children with other kinds of problems—mental and physical illness and sub- 
stance abuse—is also likely to be less satisfactory for parents. 

How does the coresidence experience differ for parents with returning chil- 
dren? Judging from popular accounts and results from a very small sample 
(Clemens & Axelson 1985), parents are often dismayed by the return of their 
children. How this affects day-to-day relationships in the reconstructed house- 
hold is unclear. Do parents and children negotiate new authority and household 
labor structures, or do they fall back into childhood habits? Schnaiberg & 
Goldenberg (1989) liken children’s returning (or simply staying past the stage 
when parents’ expected them gone) to a state of Mertonian anomie, suggesting 
that parents may respond by falling into states of retreatism or ritualism. If 
this is so, adult children may be in the household but not of it. The very low 
levels of household chores that coresident adult children are reported to do 
(e.g. Ward et al 1992) certainly suggest that they may not be very well 
integrated into the household. Although people who live with others are better 
off than those who live alone on a wide variety of dimensions, a tantilizing 
suggestion that young adults living with their parents report more loneliness 
than young adults living alone (cited in Clemens & Axelson 1985) underscores 
the need for more information about interaction in parent-child households. 

Alice Rossi (1968) has described four stages of parenthood: anticipation, 
honeymoon, plateau, and disengagement. She suggests (p. 30) that disengage- 
ment is not marked by “any specific act but is an attenuated process of 
termination with little cultural prescription about when the authority and ob- 
ligation of a parent end.” The link between coresidence, home leaving, and 
disengagement would be an instructive study. 


Children’s Perspectives on Coresidence/Launching 


Relatively less research has focused on asking how adult children cope with 
coresidence and home leaving. Although the 1987-1988 NSFH study included 
questions for young people comparable to those Aquilino (1991b) has analyzed 
for parents, analysis of children’s satisfaction has yet to appear. Young (1987b) 
reports that children with more negative attitudes toward living at home are 
more likely to leave home, but she does not provide any information about 
levels of or predictors of children’s satisfaction. 

Shehan & Dwyer (1989) asked college students to list the advantages and 
disadvantages of coresidence for themselves and their parents. They report that 
students focused on the inconvenience to themselves and failed to identify 
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problems parents might experience. Children's assessment of their parents' 
benefits from coresidence were more perceptive: Between one quarter and one 
third identified companionship as a benefit parents derive from having 
a coresident child, figures strikingly similar to those reported by Aquilino 
(1991b) for the percentage of parents who report a positive effect of coresid- 
ence on companionship. Shehan & Dwyer's students reported that the primary 
benefits for themselves were financial assistance, emotional support, and com- 
panionship. 

Hartung & Sweeney (1991) interviewed a handful of coresident adult chil- 
dren and asked them directly whether they were living at home because they 
couldn't afford to establish their own household or because they didn't want 
to forego their parents' standard of living. They concluded that most of the 
children could afford to live independently, but not well and independently. 
The adult children they interviewed were willing to forego some independence 
and tolerate some restrictions in order to have more luxuries. 

Analysis of attitudinal data from the Monitoring the Future study suggests 
that Hartung & Sweeney's respondents may be representative of their gener- 
ation. Comparing 1977 and 1986 high school seniors, Crimmins and associates 
(1991) document large increases in the proportion of young people who think 
it is important or very important to have "lots of money" and consumer goods 
such as the latest fashions, a good stereo system, and a new car. Crimmins 
and associates (1992:130) refer to this as an “intergenerational taste effect,” 
and argue that, as the luxuries of one generation have become necessities of 
the next, the minimum level of earnings necessary to leave the parental home 
has risen. 

If this is true, it raises more questions than it answers—because the propor- 
tion of unmarried young adults living at home has actually decreased. Among 
the explanations for this paradox are that parents’ desire for independence has 
grown even more rapidly than their children’s desire for goods, so that parents 
subsidize nonresident children more heavily than before or that rising aspira- 
tions dampen marriage rather than independent living. 


The Launching Phase of Parents’ Lives 


The launching phase of the family life cycle minimally stretches from the time 
the first child leaves home for the first time to the time when the last child 
leaves home for the last time. In the case of a parent with two children closely 
spaced, and if the son is the oldest, the whole launching phase may be com- 
pressed into a few months or a year. On the other hand, if there are many 
children or they are widely spaced and if the young adults are slow to marry, 
launching can easily extend to cover 10 or 20 years (Young 1989). 

Those who must wait longer to achieve the empty nest are those who have 
more children, more sons, more unmarried children, an intact first marriage, 


CORESIDENCE AND LEAVING HOME 97 


other relatives or nonrelatives in the household, a higher income, or are of 
African or Mexican American descent (Aquilino 1990). In intact marriages, 
the empty nest occurs later in men's lives than women's. In the case of divorce, 
however, fathers experience much less coresidence (Aquilino 1990) or indeed 
any interaction with their adult children (Cooney & Uhlenberg 1990). 

Personal observation suggests a widespread delusion about the length of 
parenting obligations. New parents often speak of 18 years of obligation, and 
even Barber (1989) sets out as typical a scenario in which the empty nest is 
achieved 25 years after marriage and 20 years after the last child is born. Yet 
Young's data show median ages of 55—59 for the last child leaving home for 
the last time [a figure consistent with Mayer & Schwartz' (1989) German 
data], and US data show 13-15% of the over-65 still having coresident chil- 
dren. Schaiberg & Goldenberg (1992) comment that the post-war generation 
of parents expected to experience more intensive parenting but for a shorter 
period of time than did previous generations of parents. This unfulfilled ex- 
pectation probably has more to do with current impressions of unwarranted 
levels of coresidence than any change in actual coresidence rates. 


WHY STUDY CORESIDENCE? 


Coresidence is a relatively simple matter of household composition, and critics 
charge that it tells us little about relationships. Exchange and support "may be 
close and intensive even though parents and children do not live together any 
longer, and, conversely, they may be very weak even though co-residence 
continues" (Mayer & Shwartz (1989:148). If the link between coresidence and 
these variables of interest is so weak, why study coresidence? 

Information on household structure cannot substitute for knowledge of fam- 
ily process, but it is important in its own right. Living arrangements create or 
retard opportunities for exchange, cooperation, and conflict (Kertzer 1986), 
and it would be very surprising if a positive correlation was not observed 
between coresidence and nearly all types of interaction. It must be acknowl- 
edged, however, that there is little empirical evidence. 

In one of the only studies linking residence to exchange, Rossi & Rossi 
(1990) report that amount of exchange between parents and children is much 
less for pairs that live apart than together, with a bigger difference in amount 
of aid children give to parents than in what parents give to children. The 
relationship between coresidence and exchange may differ by social class. 
Certainly coresidence is a form of social support more common among African 
American than white families (Hogan et al 1990), and it seems plausible that 
coresidence is a better indicator of instrumental exchange as we go down the 
social class ladder. 

Despite its omission in studies of the transition to adulthood, the end of 
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coresidence has obvious implications for parents’ and children's lives. Matras 
(1990:233) suggests that 


“(T]he critical shift associated with the end of co-residence in the parental house- 
hold is the end, or at least the very substantial cutback, in parental authority over 
the child. Continued economic support, advice, cooperation, healthcare, and emo- 
tional support very frequently take place... [but] departure of children from the 
parental household introduces and legitimates a new dimension of voluntarism, 
on the one hand, and diminished accountability, on the other, into the child-parent 
relationships and interaction." 


The end of day-to-day supervision and the onset of nonfamily living are 
stressed by Waite and associates (1986) and Goldscheider & Waite (1991). 
These consequences may be greater for young women, whose lives have 
traditionally been more highly supervised and family bound than young men's. 
Goldscheider & Waite (1991:68) argue that residential independence gives 
young women a taste of freedom that may decrease their likelihood of mar- 
riage, while residential independence predisposes young men to value more 
the services that families (i.e. women) generally provide and thus increases 
their odds of marrying. 


Early Homeleaving 


Special concern has been focused on the minority of young people who leave 
home earlier than usual. Early home leaving may simply be an intervening 
variable in a chain linking childhood disadvantage to adult disadvantage (Mit- 
chell et al 1989), but early home leaving may have independent negative effects 
(Kerckhoff & Macrae 1992). Among the negative effects hypothesized are: 
more emphasis on employment than education and lack of financial backing 
for early adult transitions (Mitchell et al 1989); earlier (and less successful) 
marriages and cohabitions; earlier (and maybe less successful) work lives 
(Aquilino 19912); and reduced family control over work and marital decisions 
(Goldscheider & Goldscheider 1987). There is, however, almost no data to 
support concern over early home leaving as an independent determinant of 
adult outcomes. 

The consequences of early home leaving seem likely to depend on age at 
home leaving, destination, and parental support after departure. Unfortunately, 
research on early home leaving tends to lump together all respondents who 
leave before age 19 without distinguishing 14-year-old runaways from 18 
year-old high school graduates who leave their parents' home but not their 
protection. The flow of aid from parents to children at the launching stage 
plays a key role in the intergenerational transmission of status, and early 
severence of the tie between parent and child would certainly interfere with 
status attainment. Until we distinguish independence from the end of coresid- 
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ence, it will be impossible to determine the consequences of age at home 
leaving. 


DIRECTIONS FOR FUTURE RESEARCH 


The literature on coresidence is distinguished by sophisticated analytic tech- 
niques and large, national data sets. A substantial portion of the research uses 
longitudinal data sets that track young adults relatively frequently over periods 
as long as a decade. It is limited, however, by a narrow concern for co-residence 
rather than the processes for which coresidence may set the stage. It is also 
limited by a focus on children who came of age between 1972 and 1986, 
children born to the earliest-marrying and thus earliest-leaving cohort perhaps 
in western history. How these children and parents cope with continued de- 
pendency may thus be unique. Two decades of dramatic change in economic 
opportunity and marriage behavior separate the current generation of parents 
and their young adult children, and it may be that their experiences are rela- 
tively unique. In any case, the focus on a single cohort virtually excludes 
concern for macro-structural factors. 

Hagestad (1987) has commented on the alpha and omega of family research: 
we know a lot about childhood and old age and relatively little about the 
in-between years. Coresidence is important, but it is now time to move on to 
investigating processes for which coresidence may set the stage. Among the 
issues needing to be addressed: 

* Authority patterns and exchange relationships among young adults and 
their parents and their linkage to coresidence. What is the flow of money, 
services, and support between parents and children? What are patterns of 
supervision and authority? How does coresidence relate to these patterns? Is 
early home leaving the same as early independence, or—as implicitly assumed 
in the the transition to adulthood literature—is living arrangement relatively 
unimportant? 

* Consequences of early home leaving for children. Research on early home 
leaving assumes negative consequences for early home leavers, yet the effects 
of early home leaving on the status attainment process or the transition to 
adulthood remain almost totally undocumented. 

* Race and ethnic differences in coresidence and home leaving experiences. 
Because of strong differentials in likelihood of coresidence, evidence that some 
predictors of coresidence interact with race, and an important debate over the 
relative supportiveness of African American, Anglo, and Hispanic families, 
this issue deserves furthur research. 

* Norms and expectations for parent-child relationships in the early adult 
years. The widespread illusion that children are staying longer when there 
has been little change in overall coresidence patterns and actually a sharp 
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decline in the coresidence of unmarried children suggests that expectations 
have changed more rapidly than behavior. On the other hand, if coresident 
children's household contributions have declined and their expectations of 
support have risen, they may not only seem to be more of a burden, they may 
actually be more burdensome. 

Although coresidence is important in itself, we need to dig deeper into the 
quality of the relationships implied. Some of the unanswered questions noted 
in this review can be answered using secondary data sets already available, 
but others require more detailed and probing questions than are likely to appear 
on an omnibus social survey. Qualitative and social-psychological studies that 
probe meanings and interpretations would be a useful complement to the 
structural data already existing. 
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Abstract 


In this paper we review the empirical legacy that developed out of the theo- 
retical and methodological agendas of class analysis and comparative methods 
of the 1970s and 1980s. Our review is restricted to studies that examine the 
variants in the class structures of the developed capitalist democracies. The 
paper is organized into four main sections. In part one, we examine variations 
in the organization of capital; part two takes up the still little-studied resurgence 
of the petite bourgeoisie and small capital; in part three, we review studies 
that have examined national variations in the size and composition of the new 
middle class; and part four reviews the postindustrial and gendered nature of 
the working class. Our conclusion highlights important labor market develop- 
ments of the 1980s that have largely been missed by conventional class models 
and comments on their significance for the future of empirical research in class 
analysis. 


INTRODUCTION 


During the 1970s, the ghost of Marx returned to dominate sociological debate 
in both Europe and North America. Rare was the student who could pass 
through graduate school untouched by the renaissance in class analysis. Subtle 
differences between the class models of Carchedi, Poulantzas, and Wright, and 
critiques of these "structuralist Marxists” from “neo-Weberians” like Giddens, 
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Goldthorpe, and Parkin, provided the fodder for coffee room debate and the 
inevitable source of interrogation at doctoral examinations. 

The methodological departures of the 1970s and 1980s were equally 
striking. If not abandoned, the search for universal laws, standard fare in the 
textbooks of the 1960s, was tempered by a renewed emphasis on the 
historical and the contingent. The universalizing mode shared by both 
modernization theory and "scientific" Marxism increasingly gave way to the 
search for explanation of the multiple trajectories of change and development 
evident in industrial/capitalist societies and countries of the Third World. 
Particular nations were no longer understood merely as an instance of the 
unfolding of the general laws of either capitalism or industrialism. The upshot 
was a renewed emphasis on comparative-historical methods of investigation 
and a shift from general theory to explanation of actual historical sequences 
as a preferred analytical strategy. 

Neither tendency, of course, has gone unchallenged, but our purpose here 
is not to debate the relative merits of these countervailing trends. Rather our 
£oal is to examine the empirical legacy that developed out of the theoretical 
and methodological agendas of class analysis and comparative methods in the 
1970s and 1980s. Debates over theory and method are fruitful to the extent 
that they give birth to new questions to guide empirical research. The agenda- 
shaping debates of the 1970s gave birth in the 1980s to a considerable research 
enterprise ranging from in-depth historical case analysis to cross-national com- 
parative surveys. Our focus here lies almost exclusively on those studies that 
examine the variants in the class structures of the developed capitalist democ- 
racies. 

In the body of the paper we highlight what has been learned from this 
endeavor, and our aim is mainly descriptive. In the conclusion, we turn our 
attention to the limits of (now) conventional methods of class analysis. As the 
Fordist world of wages, employment contracts, and social programs began to 
unravel in the 1980s, the inability of both class analysis and conventional 
stratification studies to account for, or even detect, this transition became 
apparent. This failure of the “positional approach" (Parkin 1971) common to 
both traditions indicates that reformulation— both theoretical and methodolog- 
ical—is now in order. 


VARIETIES OF BOURGEOISIES IN ADVANCED 
CAPITALISM 


The main arguments that informed studies of the capitalist class during the 
post-Second World War era were in fact formulated in the first half of the 
century. One of these arguments, first developed by Hilferding (1981) and 
later popularized by Lenin ([1917] 1975), concerned the fusion of industrial 


CLASS STRUCTURE 105 


and financial capital to form finance capital. The second agenda-setting argu- 
ment was that of the separation of ownership and control in joint-stock com- 
panies (Berle & Means 1932). Although later generations would debate these 
issues as though they were universal trends in all capitalist economies, both 
arguments emerged within a specific historical context—pre-First World War 
imperial Germany in the first case and the Progressive-era United States in the 
second. As we shall see, the recovery of historical context and insistence on 
national variants proved to be among the major contributions of the research 
generated by these debates. 

Neo-liberals and neo-Marxists came up with surprisingly similar views on 
the "separation of ownership and control," although for very different reasons. 
As one of the leading neoliberal sociologists in the post-Second World War 
period, Dahrendorf (1959) proposed that the emergence of the joint-stock 
company signified the "decomposition of capital" into two distinct parts, 
namely, ownership and control. The most important consequence of this de- 
composition, he further argued, was the shift of the power in production from 
legal owners to managers who do not necessarily derive their "authority" from 
legal ownership. Such arguments abounded on the neo-Marxist side as well. 
According to Poulantzas (1975, p. 18), for example, what mattered was not 
legal ownership but "real economic ownership," by which he meant the “power 
to assign the means of production to given uses and so to dispose of the 
products obtained." Thus, neo-Marxists and neoliberals alike turned on its head 
the classical Marxist thesis which stated that, in newly emergent joint-stock 
companies, the “employer of capital" had become a subordinate of the “owner 
of capital." 

Unlike the debate on separation of ownership and control that emphasized 
divisions within the capitalist class, the finance-capital argument referred to 
those inter-sectoral linkages that make the capitalist class cohesive. Whereas 
the former debate concerned the centralization of control and the diffusion of 
ownership, the latter was about the inter- (and intra-)sectoral centralization of 
capital in general. Empirically minded sociologists, based mainly in Anglo- 
American countries, tried to resolve the issue by investigating the "networks 
of interlocking directorships" in the corporate world (Herman 1981, Mintz & 
Schwartz 1985, Ornstein 1989, Scott 1985, 1986, Useem 1984). Though often 
ahistorical and descriptive, the resultant subindustry of corporate research 
showed that among the advanced capitalist economies of Western Europe, 
North America, and Japan, the extent of corporate interlocking in terms of 
ownership and/or control varied both cross-sectorally and cross-nationally. In 
the remainder of this section, we focus on the comparative political-economy 
literature which relates this cross-national variation in capitalist-class charac- 
teristics in general to divergent patterns of capitalist development and state 
intervention. 


106 MYLES & TUREGUN 


In his Modern Capitalism (1969), Shonfield contrasted the political econo- 
mies of France and the United Kingdom. He found in the French case an etatist 
mode of state intervention in the economy, characterized by dirigisme and 
indicative planning. In contrast, UK liberalism favors government at “arm’s 
length" in economic affairs. Later work refined this dichotomy and added a 
new division. Katzenstein (1985), for example, identified three main political 
forms that contemporary capitalism has taken on: (a) liberalism in the United 
States and the United Kingdom; (b) statism in France and, to an extent, Japan; 
and (c) corporatism in some small states of Europe and, to some extent, 
Germany. Corporatism is distinguished from the other two patterns by an 
ideology of social partnership between labor and capital, a centralized system 
of interest representation, and a "culture of compromise" (see also Katzenstein 
1984, 1987, 1989). 

Zysman (1983) made an important contribution to this typology by high- 
lighting the financial aspect of the divergence in state-economy interaction. 
He distinguished three main “national financial systems”: (a) the capital mar- 
ket-based system in the United States and the United Kingdom; (b) the credit- 
based system with government-administered prices found in France and Japan; 
and (c) the German credit-based system dominated by financial institutions. 
Under the capital market-based system, the allocation of resources is based on 
competitively established prices, and the relationships among banks, industrial 
firms, and the government are "at arm’s length." Whereas the credit-based 
system of France and Japan encourages governments to intervene in industry, 
the German system gives the banks enough market power to have a say in 
industry and to negotiate with governments. 

But even among these types there are significant differences. The major 
divide in liberal political economies is between the United States on the one 
hand, and the United Kingdom, Australia, and New Zealand on the other, with 
Canada somewhere in the middle. In the United Kingdom, a historical "gap" 
has defined relationships between industrial and financial capital. To this day, 
the "City"—a square mile of banks and discount houses—remains as the 
graphic expression of the "hegemonic" power of financial capital within the 
UK capitalist class (Coates 1989, Scott & Griff 1985). 

The replication of UK commercial-banking pattern in Canada, Australia, 
and New Zealand also resulted in a financial-industrial divide. In these “White 
Dominions," banks adopted a hands-off approach toward industry, which in 
turn and as a result was penetrated by foreign capital to varying degrees. In 
Canada, for example, financial-commercial capital identified with a few large 
chartered banks has had no interest, past or present, in industrial financing, 
which has been done instead mainly by US capital particularly in extractive 
and manufacturing industries (Clement 1975, 1977, Naylor 1975) Conse- 
quently, the capitalist class in Canada was born and evolved fractured along 
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both sectoral and national lines (for a critique of this thesis from the vantage 
point of the literature on corporate interlocks, see Brym 1989, Ornstein 1989). 

In the United States, the industrial-financial divide has never been as wide 
as it was in the United Kingdom and its “White Dominions.” As a “late-comer,” 
the United States developed a system of investment banking that accelerated 
industrialization substantially, and the financial and industrial "fractions" of 
the capitalist class were more congenial to each other's concerns than they 
would be under the reign of commercial banks (Mintz & Schwartz 1985, Scott 
1986). Functionally, however, the US capitalist class is more differentiated 
than its UK counterpart. What is often called “managerialism” in the United 
States (Useem 1984), thus, refers to the separation of the function of control 
from that of ownership in the corporate sector (Zeitlin 1989). 

The statism of France, Italy, and Japan is more a matter of regulation than 
of ownership. Although the French and Italian (but not Japanese) states do 
own and operate a number of large holding companies, the main characteristic 
of statism in all three political economies is that governments use their financial 
and administrative means to pick winners and losers within and between 
sectors or even regions—a pattern of selective intervention rooted in the initial 
period of industrialization. Japanese industrialization following the Meiji Res- 
toration was a politicized process (Landes 1965) where, in addition to preparing 
the financial and technical conditions for private investment, the state estab- 
lished model factories later turned over to private entrepreneurs. The capitalist 
class that emerged from this state-led industrialization was internally cohesive 
and externally tied to government regulatory mechanisms, and the state in 
Japan continues today to lead the capitalist class (on the regulatory role of the 
Ministry of International Trade and Industry, see Johnson 1982, Morioka 
1989). 

In France, relationships between the state and private business, in which 
family-owned holding companies are predominant, have conformed for much 
of the post-Second World War period to the etatist pattern with occasional 
ventures into "indicative planning" (Michalet 1974, Swartz 1985). The national 
champions picked by the state pay back their sponsors through pantouflage, 
i.e. the transfer of senior civil servants into top positions in the private sector 
(Marceau 1989). In comparison with France, Italy has a larger state sector 
which, like family-owned big businesses, is organized into holding companies 
(Chiesi 1985, Prodi 1974). Reflecting patterns established during Risorgimento 
and fascism, the Italian capitalist class is divided both along class (private-state 
and small-big businesses) and regional (north-south) lines (Martinelli & Chiesi 
1989). 

Katzenstein (1985) divides the corporatist political economies into three 
groups, "liberal" (Belgium, the Netherlands, and Switzerland), "social" (Aus- 
tria, Denmark, and Norway), and "mixed" (Sweden), on the basis of the 
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strength of the labor movement and the market orientation of the business 
community. Whereas the liberal variant has a relatively weak labor movement 
and a foreign market-oriented business community, the social variant has a 
relatively strong labor movement and a domestic market-oriented business 
community. In the mixed (Swedish) variant, labor is strong and the business 
community is internationally oriented. By Katzenstein's criteria, Germany is 
an instance of mixed corporatism, whereas Denmark is closer to the liberal 
wing. The timing of industrialization is a major factor accounting for this 
divergence: liberal corporatism was forged by early industrializers; and the 
social and mixed patterns, by late industrializers. 

Not unlike the United Kingdom and the settler Dominions, the Netherlands 
and Switzerland, whose state sectors are very small, developed a commercial- 
banking system which divides the financial and industrial functions of capital 
(see Stokman et al 1985a for the Netherlands, Rusterholz 1985 for Switzer- 
land). In Belgium, an early industrializer, development took place under an 
investment-banking system, whose industrial and banking functions would 
later be separated through a holding-company structure (Cuyvers & Meeusen 
1985). In contrast, in the late-comer countries where social corporatism took 
root, the levels of state ownership and industrial-financial fusion have been 
very high. The prime example is Austria where the state sector accounts for 
about one third of nonagricultural GDP (Ziegler et al 1985). 

In the mixed corporatism of Germany and Sweden, the state has only a 
minimal ownership role. Though the Swedish model has recently shown signs 
of unravelling (Ahrne & Clement 1992), both the capitalist and working classes 
have traditionally displayed an unusual degree of compactness (see Katzenstein 
1987, 1989 for Germany, Ohlin 1974 for Sweden). That these main classes 
are equally compact in organizational terms forms the basis of a centralized 
system of representation. One factor accounting for the sectoral and class 
cohesion of the capitalist class in both countries is the investment-banking 
system which developed as a solution to the problems of late industrialization. 

This comparative literature has shown that the universalizing statements of 
the early and mid-twentieth century regarding the sectoral fusion and func- 
tional differentiation of capital were built on, and limited to, the particular 
trajectories of capitalist development. Until recently, safer ground for univer- 
salism has been the fate of the petite bourgeoisie in any such trajectory. 


THE RETURN OF THE PETITE BOURGEOISIE 


For classical sociology, Marxian and Weberian alike, the story of the petite 
bourgeoisie was a foretold one. Independent producers who own and operate 
their own means of production would fight a losing battle under the conditions 
of an industrial economy. The predicted gradual erosion of the petite bour- 
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geoisie’s economic bases was attributed to the increasing polarization of prop- 
erty relations, in Marx's words, the concentration and centralization of capital. 
The petite bourgeoisie was a phenomenon of the past, not the future. As a 
sociologist trying to bridge Marx and Weber, Mills (1951, pp. 14, 28) aptly 
summarized classical sociology's stance on the issue when he declared that 
the centralization of property, ending the "union of property and work," 
sounded the death-knell of the old middle classes which had been “clogging 
the wheels of progress.” 

The resounding performance of the advanced capitalist economies, espe- 
cially after the Second World War, took more and more people out of the 
ranks of the rural and urban petite bourgeoisie into the ranks of both the 
working class and the new middle class. Consequently, the petit-bourgeois 
share of the labor force declined, though variably across the nations, during 
most of the postwar period. For both Weberians (Bechhofer & Elliott 1968, 
1976, 1978, 1985) and Marxists (Burris 1980), this development marked the 
removal of the petite bourgeoisie from the main axis of class relations in 
advanced capitalist societies. 

However, toward the close of the 1970s with the postwar boom at an end, 
sociologists of various persuasions began to rethink the traditional view and 
to question long-established conceptions on the historical trajectory and social 
position of the petite bourgeoisie including their marginality to capitalism, 
technological backwardness, economic inefficiency, and uninterrupted decline 
(Curran & Burrows 1986, Scase & Goffee 1982). A central theme of the 
“reformed” perspective was the ideological, political, and economic functions 
of small commodity production in the reproduction of advanced monopoly 
capitalism. This would accordingly set the research agenda in the late 1970s 
and early 1980s. 

Those who saw contemporary economies as divided into the "traditional" 
and "modern" sectors concentrated on the mechanisms of articulation and 
integration between the two. In her studies of Italy and France, the only major 
European economies which contain a sizable petit-bourgeois population, Ber- 
ger (1980, 1981) tried to make sense of the resilience of the small-scale 
traditional sector in terms of its functions for the large-scale modern sector— 
the production of high-demand goods that large firms do not produce, the 
cushioning of the economic impact of hard times on the modern sector by 
reducing unemployment, and providing the (modern) economies of a rigid kind 
with flexibility through subcontracting. Political factors were also given a role. 
The Gaullist parties in France and Christian Democrats in Italy have histori- 
cally been protective of the class of small producers for both short-term 
electoral and long-term alliance purposes (see also Weiss 1984 for Italy). 

Portraying the petite bourgeoisie as a passive resource readily used by the 
larger forces of the society is not obviously a radical break with the traditional 
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perspective. But now the self-induced dynamics of small production were also 
highlighted. In the case of France, for instance, beginning with the Poujadist 
movement of artisans and shopkeepers in the 1950s, the petite bourgeoisie 
succeeded in extracting a number of financial concessions from governments. 
And due to the historical prestige it has enjoyed among consumers, French 
artisanal bakers were able to defeat “industrial bread” at the market level 
(Bertaux & Bertaux-Wiame 1981). Another example of successful petit-bour- 
geois mobilization against government policies was the mushrooming of right- 
wing populist parties in Scandinavia (except Sweden) during the late 1960s 
and early 1970s. Propelled by a sense of the erosion of the petite bourgeoisie's 
economic bases, these parties sought and managed to influence the policies of 
the mainstream parties (Hoff 1982). 

The 1980s proved to be a turning point in the petite bourgeoisie's (mis-)for- 
tune, both socially and academically. Neoconservative governments which 
came to office with an unambiguous anti-labor and pro-business agenda tar- 
geted the small business sector as a “test case” of their market-based initiatives 
in policy and, most conspicuously, ideology. It was against this political 
background that a new generation of studies on the petite bourgeoisie began 
to appear. The central theme of the new stream of research was the arrest of 
the decline in self-employment or, more arguably, the resurgence of the petite 
bourgeoisie. And implicitly, this amounts to nothing less than a claim that the 
centralization, if not concentration, of capital has come to an end. 

In the past two decades the decline of the “old” middle class and small 
capital in general has been reversed in many capitalist economies, Two trends 
demonstrate this fact. The first is a rise in self-employment in countries as 
diverse as Canada, the United States, the United Kingdom, Italy, Belgium, 
Finland, and Ireland (Steinmetz & Wright 1989, OECD 1992). Until the 
mid-1980s, truly dramatic growth rates in self-employment levels were limited 
to the Anglo-American democracies—Australia, Canada, the United States, 
and the United Kingdom—all of which embarked on ambitious programs of 
neoliberal economic restructuring that favored the growth of small enterprise. 
But since the mid-1980s, even social democratic Sweden has witnessed a 
remarkable rise in self-employment from 4.5% of the nonagricultural labor 
force in 1985 to 7% in 1990 and among men from 6.1 to 10.2% (OECD 1992). 
Until the 1980s, rapid employment growth in the public sector had been the 
foundation of Swedish full employment policy (Esping-Andersen 1990). Since 
then, greater emphasis has been placed on local employment programs and 
small business start-ups, a strategy that has enhanced the space for small-scale 
capital in Sweden (Johanisson 1989). 

The second trend illustrating the changing fortunes of small capital is the 
rising share of employment in small firms and a decline in average firm size. 
By the mid-1980s the OECD (1985) had noted declining firm size in Austria, 
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Belgium, France and Japan, and when only manufacturing is considered, in 
Denmark, Luxembourg and the United Kingdom as well. By the end of the 
1980s, substantial downsizing had been noted in both Canada (Wannell 1991) 
and the United States (Granovetter 1984, Davis & Haltiwanger 1989, Bryn- 
jolfsson et al 1989). 

These developments suggest nothing less than a reversal of the long-term 
trend toward the centralization of capital and a resurgence of the petite bour- 
geoisie and small capital. Not surprisingly, reactions to these developments 
have been cautious. In Britain, for example, many of those who are officially 
classified as self-employed do not meet the usual criteria of self-employment, 
such as autonomy in the workplace or the ownership of the means of produc- 
tion; they are in fact contractual workers (Dale 1986). In the United States, a 
significant portion of the officially self-employed (especially in service occu- 
pations) experience subproletarian conditions in terms of income, social secu- 
rity, skill, and autonomy (Linder & Houghton 1990). Since official class- 
ification methods are not able to make an unambiguous distinction between 
the self-employed and employees, corporations may register employees as 
self-employed to avoid taxes and social security payments. Nevertheless, as 
Steinmetz & Wright (1990) observe, self-employment growth in incorporated 
businesses—the most likely site of real petite bourgeois expansion—has been 
more rapid than in unincorporated businesses where the marginal self-em- 
ployed are likely to be found. 

Progress in identifying the causes of this reversal and its national variants 
has scarcely begun. Traditionally, self-employment has grown as a short-term 
cyclical response to recession and high unemployment. National studies sug- 
gest that this is the case in some industries, such as construction trades, but 
overall the counter-cyclical effects of increasing unemployment on self-em- 
ployment have been declining (Sharpe 1983, Steinmetz & Wright 1989). 
Rainnie (1985) offers a classical Marxist explanation for this “relative retreat” 
of the process of concentration and centralization. When large firms found 
themselves in a crisis situation during the late 1970s, the argument goes, they 
began to experiment to avoid high wages, union militancy, and the high costs 
of innovation, by fragmenting some of their operations into smaller units and 
by “contracting out” others to small firms. However, it has been pointed out 
that this corporate strategy of downsizing and devolvement (e.g. franchising, 
licensing, and subcontracting) can account for only part of the small-business 
revival (Shutt & Whittington 1987). 

In the now abundant literature on post-Fordism (e.g. Piore & Sabel 1984), 
the resurgence of small capital is associated with the end of the Fordist model 
of mass production. The emergent and increasingly successful technologies, 
it is argued, combine skilled labor with general purpose machinery to produce 
small batches for specialized markets (Brusco 1982, Brusco & Sabel 1981). 
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Economies of scope, not of scale, are now the order of the day (but see Murray 
1983, 1987). The shift to services—especially consumer and business ser- 
vices—is a contributing factor. But again it is clear from national studies that 
this accounts for only a small share of the total shift in self-employment 
(Steinmetz & Wright 1989) and declining firm size (Wannell 1991). Brynjolfs- 
son et al (1989) show that increased investments in information technologies 
result in a decrease in vertical integration and greater reliance on market 
transactions to obtain inputs and distribute outputs. In short, the reduced 
transaction costs brought about by new technologies bring a shift from “hier- 
archies" to "markets." 

If the causes of this development are ambiguous, the consequences for 
workers are more apparent. Those employed in small firms tend to have lower 
wages, fewer benefits, are less likely to be unionized, and tend to have higher 
turnover rates—issues to which we return in our conclusion. 


IN SEARCH OF THE NEW MIDDLE CLASS 


Few questions in modern sociology are as disputed as those surrounding the 
“new middle class.” Who is in it? What are its boundaries? What is its future? 
In postwar sociology, the dividing line was frequently drawn between white 
collar and blue collar workers using conventional and relatively simple occu- 
pational classifications. The “working class" was composed of factory workers, 
truck drivers, and other blue collar wage earners who did "manual" work. 
Managers, school teachers, and even clerks and secretaries—"non-manual" 
employees who worked for salaries rather than wages and in offices and stores 
rather than on the shop floor—made up the new and expanding middle class. 
For Mills, White Collar provided a more than adequate title for his major study 
of middle class life in postwar America. 

With the passage of time, the boundaries of the new middle class were 
conceptually whittled away. The armies of (largely female) clerks, secretaries, 
retail sales and service workers are no longer accorded “middle class” status 
by the majority of either conventional or Marxist sociologists (but see Gagliani 
1981). By virtue of their typical earnings, job requirements, and position in 
the “relations of ruling” of the contemporary enterprise, they are more likely 
to be counted among the lower echelons of a new postindustrial proletariat 
instead. At the other end of the spectrum, those corporate executives 
who exercise traditional entrepreneurial functions of investment and allocation 
(strategic decision-making, to use A Chandler’s terms) are now typically 
counted among those who exercise "real economic ownership,” that is, as a 
fraction of the ruling class or bourgeoisie. 

By the 1970s virtually all class theorists—Marxist and Weberian—had 
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converged on the centrality of two broad strata for understanding the class 
structure of advanced capitalist societies: the growing army of mid-level cor- 
porate officials engaged in the day-to-day administration of the modern firm 
on the one hand, and the professional and technical "knowledge workers" who 
have become virtually synonymous with postindustrialism, on the other. Just 
how to treat the latter group was a matter of considerable contention. For the 
Ehrenreichs (Ehrenreich & Ehrenreich 1979), these employees became part of 
the “professional-managerial class,” for Goldthorpe (1982) part of the “service 
class,” and for Poulantzas (1975), part of the “new petite bourgeoisie.” In 
Wright’s (1978) initial class schema, the new “knowledge workers” outside 
the traditional axis of managerial authority were allocated to a “contradictory 
class location” —semiautonomous workers—while in his later version (Wright 
1985), the credentials of the “expert class” are seen as a new basis of class 
exploitation. 

Irrespective of theoretical predilection, both groups had to be taken seri- 
ously if for no other reason than numbers: together, “new middle class” 
administrative, professional, and related occupations made up between a 
quarter and a third of the labor force in the developed capitalist economies 
by the beginning of the 1980s. And despite the plethora of case studies 
showing processes of “deskilling” in particular occupations and the expansion 
of low wage “junk jobs” in the service sector, “new middle class” positions 
continued to grow disproportionately to the rest of the economy through the 
1970s and 1980s. 

In what could be counted as a “first generation” of comparative studies to 
discuss national differences in the size and growth of the new middle class, 
authors such as Burris (1980), Livingstone (1983), Carboni (1984), and Orn- 
stein (1983) followed the expedient of adjusting occupational distributions, 
routinely published by the International Labour Organization. to approximate 
a definition of the “new middle class.” These studies pointed to the fact that 
the United States and the United Kingdom, countries that “have occupied a 
dominant position within the capitalist world economy” (Burris 1980, p. 167), 
were also where the new middle class had attained the most significant dimen- 
sions. Just as the decline of the old middle class was associated with the 
transition from feudal-agrarian to a capitalist-industrial economy, the rise of 
the new middle class could be understood as “the product of developmental 
tendencies specific to the advanced stage of monopoly capitalism" (Burris 
1980, p. 164). Especially notable was the large size of the managerial strata. 
Carboni's estimates (1984, p. 136) showed the United States and Britain with 
10.596 and 9.496 of the labor force in management occupations, respectively, 
compared to 396 to 596 in most other countries at the end of the 1970s. Sweden, 
a late industrializer, was also notable for its relatively large new middle 
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class—indeed the largest in Burris's estimates, but this was entirely due to the 
large number of professional and technical employees. This was a result, as 
Burris argued (1980, p. 172), of "four decades of Social Democratic rule" 
which had created a labor market dominated by health, education, and other 
state welfare services. 

These authors offered a variety of plausible interpretations for the patterns 
they observed but lacked any data to evaluate their claims. Drawing on Hy- 
mer's (1972) classic essay, Burris argued that the high percentage of managers 
in the United States labor force reflects the concentration of the administrative, 
planning, and research functions in the metropolitan center of an Ámerican 
dominated multinational economy (see also Johnson 1982). For Livingstone 
(1983, p. 62), the fact that Canada had fewer managers than the United States 
reflected Canada's position as a "secondary imperialist formation" and net 
importer of foreign capital. For Burris and Carboni, Great Britain's relatively 
large managerial strata was a residue of both its imperial past and its continued 
importance as an international financial center. 

In contrast, Bowles et al (1983) argued that America’s high administrative 
overhead is largely a result of American forms of labor control that emphasize 
intensive supervision and regulation of workers. In the United States, they 
argue, the accelerated growth of the administrative component of the American 
economy after the Second World War reflects increasing intensity of supervi- 
sion over American workers, a response by employers to growing resistance 
on the shop floor. 

Esping-Andersen (1990) added a distinct "postindustrial" twist to these 
views. Because the public welfare state is underdeveloped in the United 
States, the corporate welfare system must compensate: More managers are 
required to administer the huge and complex fringe benefit programs that 
constitute America's private "welfare state." And because the United States 
lacks a well developed system of labor exchanges and worker-training 
institutions, American firms require very large personnel departments to 
recruit and train labor. 

To adjudicate among these interpretations with occupational data was 
impossible for the simple reason that occupational codes tell us little about 
what people actually do in their jobs. A minimal requirement for understand- 
ing why American workers appear to be "overmanaged," for example, is 
knowledge of what so-called managers actually do when they manage. Are 
they engaged in making high-level policy decisions (Burris), regulating 
workers (Bowles et al) or administering corporate welfare and training 
programs (Esping-Andersen)? 

Many of these difficulties were overcome by a "second generation" of 
comparative studies initiated by EO Wright at the University of Wisconsin 
which, rather than relying on the vagaries of occupational titles, used a complex 
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set of direct behavioral measures to determine what people do in their jobs.! 
In addition to the usual self-reported occupational titles, respondents in the 
Comparative Class Structure Surveys (CCSS) were asked detailed questions 
about their involvement in organizational decision-making, the forms of au- 
thority they exercise over other employees, and their place in the organizational 
hierarchy (see Wright, Costello, Hachen and Sprague 1982). 

The advantages of directly measuring what people do in their jobs was 
illustrated by one of the first studies to emerge from this set of cross-national 
surveys. In a comparison of Sweden and the United States, Ahrne & Wright 
(1983) demonstrated that the American labor force was indeed “overmanaged” 
not because America has more senior executives administering the American 
multinational empire, but because American employers hire more labor to 
supervise and regulate the activity of other workers. In a follow-up study that 
included Canada, Black & Myles (1986) were able to show that this American 
pattern—the “American Way”—was also characteristic of those sectors of the 
Canadian economy traditionally controlled by American capital and organized 
by American unions, namely, manufacturing and resource extraction. In finan- 
cial, social, and other services, the class organization of the Canadian work- 
place resembles that of Sweden. 

These studies also demonstrated that the other side of the *middle class"— 
Wright’s “semi-autonomous employees"— were proportionately twice as nu- 
merous in Sweden as in the United States, but not simply or even mainly 
because of Sweden's large public sector and developed welfare state. Rather, 
employers in all sectors of the Swedish economy employ disproportionately 
more “knowledge workers,"? and such employees are less likely to be involved 
in exercising authority over other employees. The United States does have a 
large "new middle class" by international standards, but the reason for this lies 
in its distinctive "industrial relations" system, not its position in the world 


Designed explicitly to measure national differences in class structure, a three-country data file 
(including the United States, Sweden, and Finland) became available in the early 1980s, followed 
by a five-country file (adding Canada and Norway) and at the end of the decade a 10-country file 
(now adding Great Britain, the former West Germany, Australia, Denmark, and Japan). These data 
are well known and have been described in detail elsewhere. A useful introduction to the project 
can be found in Wright (1985) and to the debates it engendered, in Wright (1989). 

Those familiar with the debates on Wright’s class typology will be aware of the especially 
contested nature of his category of “semi-autonomous” workers. Two points were at issue: first, the 
theoretical status of the category as a separate (contradictory) class location; and second, the 
measurement properties of the procedure used to identify such jobs. As a result of a special 
supplement to the Canadian version of the survey, Clement & Myles (1994, Appendix II) were able 
to demonstrate that the latter concern is unwarranted. The procedure used to identify 
semi-autonomous employees in Wright's original survey provides a valid, indeed rigorous, 
procedure for identifying “knowledge workers,” a category of workers characterized by high levels 
of self-direction and jobs with high levels of cognitive requirements. 
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economy or because of its underdeveloped welfare state. Indeed, contrary to 
the evidence derived from ILO occupational data, Clement & Myles (1994) 
have shown that as one narrows the definition of what is meant by a manager 
to include only higher level executive positions, national differences virtually 
disappear. Even with Wright's broader definition of management, Holtmann 
& Strasser (1990) show a rather narrow range of variation, with managers 
making up between 12.296 and 14.896 of the employed labor force. 

Despite improvements in measurement quality, the Comparative Class 
Structure Surveys quickly proved limited in their ability to account for the 
differences they identified. Because of their cross-sectional design, explanation 
was limited to differences that could be accounted for by compositional dif- 
ferences in industry mix or the level of state employment. Such avenues were 
quickly exhausted because country-specific patterns tend to dominate mere 
compositional effects. The result, as in the first generation of studies, was 
reversion to plausible but ad hoc historical interpretations for which there was 
little direct evidence. Australia, for example, has as many supervisory employ- 
ees as the United States and even more managers (policy-makers), a fact 
attributed to an historical attraction to "bureaucratic" solutions to economic 
problems (Boreham et al 1989). Great Britain and the United States show the 
greatest resemblance in their class structure (as the ILO data also suggest) 
despite radically different patterns of economic growth and development in 
the twentieth century (Holtmann & Strasser 1990, p. 19). 


POSTINDUSTRIALISM, WOMEN AND THE WORKING 
CLASS 


One of the oddities of the class debates of the 1970s and early 1980s is that 
theoretically there was more concern with drawing class boundaries (who was 
in the middle class, who was not) than with explaining the results once the 
boundaries were drawn. Several consequences followed from this preoccupa- 
tion. First, once the data were gathered, there was little in the way of theory 
to guide empirical investigations of class structures. Second, there was even 
less concern with national differences and changes in the composition of the 
working class who for the most part were defined as a residual category once 
nonworking (and especially "new middle") class positions were identified. 
During the 1980s, however, all this began to change. Under the rubrics of 
“post-Fordism” and “post-industrialism,” attention returned squarely to issues 
related to the dynamics of class structure in general and its consequences for 
the working class in particular. In this context, the question of gender as a 
class issue could no longer be ignored. 

The stereotypical worker who emerges from the pages of the history of 
industrial capitalism was a male, blue-collared worker employed in the pro- 


CLASS STRUCTURE 117 


duction of goods (mining, manufacturing, construction) and their distribution 
(railways, trucking, shipping). He was typically employed by a capitalist firm, 
not the state, and was the raw material from which the labor movement was 
formed. The relative and now absolute decline of the industrial (male) working 
class, is a result of changes both in the material division of labor and in the 
social composition of the people employed by capital. The first of these 
changes has to do with the shift of employment from the production of things 
to the production of services; the second is a result of the massive entry of 
women into the paid labor force. 

Drawing on Baumol (1967), Esping-Andersen (1993) postulates three po- 
tential outcomes for the working class of the “postindustrial” economies. As 
high productivity in manufacturing releases labor, the service sector's capacity 
to generate new jobs is limited because of low productivity growth. The first 
possibility, as a result, is a rise in unemployment, declining labor force par- 
ticipation, and the growth of an "outsider" class of welfare state clients. A 
second possibility is government subsidized employment, primarily in the form 
of welfare state jobs in health, education, and social services. The third pos- 
sibility is that service employment will expand in the private sector because 
of low wages that correspond to productivity differentials between sectors. 

The main consequences of these alternative scenarios will be experienced 
by women who make up the majority of service workers. In the first—low 
employment—scenario, women bear the brunt of Iow labor participation rates; 
in the government-led wage subsidy model, they will have high participation 
rates but also high levels of job segregation in social services; where low wages 
are the main source of service growth, they will be concentrated in low paid 
personal services. 

Welfare states and labor unions play key roles in shaping which outcome 
is likely to occur. Service intensive welfare states, such as Sweden, will tend 
to the high employment government subsidy model; transfer intensive welfare 
states such as Germany will tend toward the low employment, “outsider class" 
variant. In countries such as Canada and the United States with weak labor 
unions and less developed welfare states, the low wage, high employment 
model can be anticipated. 

Using reconstruction of micro-data from national censuses, Esping-Ander- 
sen (1993) is able to show rough correspondence between his theoretical 
expectations and the occupational and industry composition of the working 
class for six countries. Germany, a transfer-intensive welfare state, has had 
much slower service sector growth than elsewhere, comparatively few un- 
skilled workers in either goods or services, low female participation rates, and 
a very large “outsider class.” Sweden and Norway have high levels of service 
employment as a result of a large, unskilled, and predominantly female work 
force in the welfare state and high levels of sex segregation. North America's 
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"flexible" (ie. low wage) labor markets also have high levels of service 
employment, and a highly developed, low wage, consumer service sector. 
Britain, Esping-Andersen concludes, straddles the German and North Ameri- 
can models: slow to postindustrialize, but biased toward the low skill, low 
wage American consumer service model. 

But the implications of all this for the future of the working class are less 
than clear. Country-specific studies included in the Esping-Andersen volume 
show there is little indication that a new “postindustrial” proletariat is taking 
form except in Britain. Unskilled service employment in both North America 
and Scandinavia mainly provides entry level jobs for the young who typically 
move on rather quickly to more traditional “working class" occupations in blue 
collar occupations (for men) and clerical employment (for women). Germany 
has low mobility out of unskilled service work but comparatively few unskilled 
service jobs. Only in Britain is there a large, relatively stable, unskilled, and 
postindustrial proletariat emerging. i 


THE FUTURE OF CLASS ANALYSIS 


As class analysts, whether Marxist (Wright 1985) or Weberian (Erikson & 
Goldthorpe 1992), were developing new data bases appropriate to comparative 
studies of class structure and class mobility during the 1980s, the real world 
passed them by. Irrespective of the variety of methods and typologies these 
authors adopted, all reached the same conclusions concerning emergent trends 
in class structure: Traditional proletarian jobs in manufacturing were declining 
and the shift to services was dominated not by low-end fast food workers but 
by high end managerial, professional, and technical occupations. To put mat- 
ters simply, all of these studies converged on the claim that “good” (read 
“middle class”) jobs were on the rise and “bad” (proletarian) jobs were in 
decline (Block 1990, Boyd 1990, Esping-Andersen 1990, 1993, Goldthorpe 
1987, Mayer & Carroll 1987, Myles 1988, Wright & Martin 1987). By virtually 
all of the conventional sociological yardsticks, emergent trends in Western 
labor markets appeared to be taking us toward a postindustrial Nirvana and 
the demise if not the disappearance of the “working class” as we know it. As 
Esping-Andersen (1993, p. 225) concludes: “There is little doubt that the kind 
of polarization depicted by traditional class theory has lost much of its face 
validity.” 

Outside of the small world of sociology, however, matters looked very 
different. As Levy (1988) showed for the United States, patterns of upward 
economic (not occupational) mobility had declined dramatically since the 
1960s both between and within generations (for Canada, see Myles, Picot & 
Wannell 1993). The rapid growth in real living standards that characterized 
the post-war boom and created Goldthorpe’s famous affluent worker (better 
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known in popular North American parlance as the “middle class") had come 
to an end. During the 1980s, attention turned to the "declining middle class" 
(Kuttner 1983), a result of growing polarization in wages and earnings in 
Europe, North America, and Austrlia (Gottschalk & Joyce 1991, Gottschalk, 
1993) and, to a more modest extent, welfare state cutbacks in some countries 
(Fritzell 1992). Downsizing, contracting out, expanded use of “just-in-time” 
and temporary workers began to affect not just low wage service personnel 
but professionals and mid-level managers as well. 

Gottschalk & Joyce (1991) using the Luxembourg Income Study data base 
for the 19808 show rising inequality and polarization in wages and salaries for 
Australia, Canada, France, Germany, Netherlands, Sweden, the United King- 
dom, and the United States. All countries showed below average growth or 
declines in wages in the lower deciles and above average growth (or below 
average declines) for the upper deciles. In a five country study from the same 
data base, Fritzell (1992) shows that growing wage inequality (the main source 
of growing inequality in family income) was exacerbated by changes in the 
tax-transfer system (welfare state cuts) in Sweden, the United Kingdom, and 
the United States, but largely offset by welfare state transfers in Canada and 
Germany. The most dramatic shift in the earnings profile occurred in the United 
Kingdom where the "middle" (those earning between 5096 and 15096 of 
median earnings) declined from 62% to 43%. 

All of this came as a great surprise to economists who had become accus- 
tomed to the stable patterns in the structure of income inequality that prevailed 
from the end of Second World War until the 1970s (Danziger & Gottschalk 
1993, p. 3). Progress in identifying the causes for this shift has been slow, 
however, more a matter of chipping away at prevailing popular explanations 
and eliminating theories (deindustrialization, demographic shifts) that are in- 
consistent with the data (see Danziger & Gottschalk 1993, Gotschalk & Joyce 
1991). Only in the case of Sweden is there a clearly identifiable "smoking 
gun," namely, the breakdown of centralized wage bargaining after 1983 (Hibbs 
1990, 1991). 

The disjuncture between the conclusions of conventional sociological stud- 
ies of classes and actual trends in wages and other job characteristics since the 
1970s poses a number of theoretical and empirical conundrums. Unlike eco- 
nomic theory in which rewards and life chances are seen to accrue to persons 
and their human “capital,” sociologists have always insisted that rewards are 
attached to positions (Parkin 1971, p. 13). The class system or the structure of 
inequality is defined by the distribution of positions, and the only remaining 
issue is to identify the process by which persons are recruited to fill these 
positions. For Parkin (1971, p. 18): "The backbone of the class structure, and 
indeed of the entire reward system of modern Western society, is the occupa- 
tional order." Marxisant versions of this "positional" account differ only by 
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virtue of the way "positions" in the structure of empty places are defined. But 
as Levy (1988) clearly shows, occupational mobility is a poor indicator of 
economic mobility either intergenerationally or over the life course. And one 
of the earliest lessons learned from the "declining middle" research is that 
changes in occupational structure (the distribution of positions) are a poor 
guide to changes in the distribution of wage and earnings (compare Rosenthal 
1985 with McMahon & Tschetter 1986). 

The failure of class-based positional models to identify key shifts in eco- 
nomic mobility and the structure of inequality is especially problematic for 
so-called neo-Weberian class models (e.g. Goldthorpe 1987) in which “life 
chances" figure prominently in the definition of classes and there is a pre- 
sumption that "classes" cause inequality. Neo-Marxist orthodoxy has generally 
insisted on separating the definition of classes defined by production relations 
from class definitions that include living standards (the sphere of consump- 
tion). But the general failure of neo-Marxist class models (Wright & Martin 
1987) to detect the polarization of the 1970s and 1980s and instead to insist 
along with everyone else that trends in class structure point to a general 
deproletarianization of labor in advanced capitalist economies must pose some 
theoretical discomfort. As it has turned out, Braverman's (1974) scenario of 
class polarization was correct at the level of consumption, not at the level of 
production relations as he anticipated. .*.nd ironically, it is a former consultant 
to the Republican Party, K. Phillips (1993), who has built on these develop- 
ments to provide the more provocative account of the historical implications 
of America's shifting “class structure.” 

Postwar North America and then Europe saw the emergence of a high-wage 
working class protected by public, corporate, and union policies to stabilize 
both wages and employment. The result was to eliminate for many the "cycle 
of poverty" (Rowntree) traditionally associated with the working class life 
course. For workers, Rowntree's life cycle of poverty was replaced with what 
might be called the “Fordist life cycle" (Myles 1990) characterized by relative 
stability of both jobs and income over the ups and downs of the business cycle 
and the individual life course. Since the 1970s, the search for labor flexibility 
has begun to unravel postwar employment relations and with them have gone 
the class and stratification implications that sociologists once felt secure read- 
ing off occupational hierarchies and class maps. For the most part, wage 
polarization and growing labor market insecurity have grown within, not 
between classes, during the 1980s. As a result, the familiar positional ap- 
proach—the "backbone" of empirical class analysis—has lost some of its 
analytical power for explaining much that is consequential to class theory, 
whether Marxian or Weberian. This hardly means that class analysis is dead. 
Instead, it calls for new analytical tools to capture the class character of late 
twentieth century capitalisms. 
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Abstract 


There is a growing realization in academic and policy circles that the double 
transition to democracy and the market may not be as simple or as easy as 
first imagined. Beginning with a discussion of the relationship between eco- 
nomic policy and regime type, this review emphasizes the problems associated 
with the uncertainty of the double transformation. It then suggests three mech- 
anisms discussed in the literature as contributing to a more stable transition. 
Some authors have emphasized the importance of social and political contracts. 
Others contend that the transition must be imposed by the domination of a 
class or the state. A third group has focused on the rise of a new civil society 
and the need for some form of trust between the various groups. The review 
concludes with a call for more field level research and a greater emphasis on 
perceptions and expectations of change. 


INTRODUCTION 


In the 1990s capitalism and democracy appear triumphant. The collapse of the 
Berlin Wall supposedly signaled not only the triumph of liberalism, but even 
the end of history (Pukuyama 1992). Yet, despite the success in the democra- 
tization of Latin America and, less so, of Eastern Europe, as well as the 
conspicuous global ascendancy of market-based policies, many remain appre- 
hensive regarding the long-term viability of the transitions to democracy and 
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capitalism. As the euphoria of 1989 passes, academic analysis has increasingly 
focused on the possible conflicts between these two goals. This review analyzes 
the dilemmas of the double transition as currently discussed in the social 
science literature.! Obviously there are immense differences between a tran- 
sition from the communism of Eastern Europe, the neoliberal military author- 
itarianism of Latin America, and the authoritarian state-directed capitalism of 
East Asia. While attempting to provide some introduction to the relevant 
literatures for each area, I have focussed on the parallels and similarities 
between the cases in the hope that this will promote more useful comparative 
research. 

This article uses a limited definition of democracy that requires the provision 
for participation of all adult members of a community, freedom to formulate 
and advocate political alternatives, and the credible possibility of turnover in 
power (Linz 1978, Bealey 1988, Bobbio 1989, Dahl 1989). Certainly, the 
concept of democracy may be defined much more broadly (Bowles & Gintis 
1986, Bunce & Hicks 1987, Shapiro 1993), but given that even the minimal 
conditions for democratic rule present difficulties, a more exhaustive set of 
criteria would make the issue of transition purely academic. 

The discussion below does not distinguish between economic adjustment or 
stabilization programs such as those directed by the IMF and more long-term 
economic restructuring. The economic transition may involve any or all of the 
following: fiscal austerity, tightening of credit, currency devaluation, trade 
liberalization, wage reductions, and/or privatization (Taylor 1988, Williamson 
1990). While there are obvious differences with regard to the impact and 
requisites for these policies, they share a common goal: fostering rapid eco- 
nomic growth within a market economy. Here I ignore the issue of whether 
such a transition is desirable or whether neoclassical economic analysis is 
appropriate (Murrell 1992, Bresser Pereira et al 1993). I assume, however, that 
such change in the short term will have serious costs for significant segments 
of the population and will generate opposition (Canak 1989, Handelman & 
Baer 1989, Walton & Ragin 1989, Sachs & Lipton 1990). 

In this review, I have favored comparative-historical approaches. Attempts 
to arrive at statistical verification of relationships between political structures 
and economic outcomes have been frustrated by the classic stumbling block 
of inferring causation from simple association. Given the nature of the subject 
matter, large-scale quantitative comparisons also face practically unsurmount- 
able problems of categorization (Lindberg & Devarajan 1993). Labels such as 


iGiven the size of the literature which could arguably be included in this essay and the speed 
with which it is growing, I have restricted myself to works available in English. I have also focussed 
on the more recent work and do not discuss classic treatments of this issue in Marx, Weber, de 
Toqueville, Schumpeter, and Marshall. 
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democratic or authoritarian are sensitive to individual judgement and anach- 
ronistic application. Measures of political legitimacy or stability are even more 
difficult. There are also problems with evaluating economic policy. Should 
one measure implementation, consistency, or success? (Bresser Pereira et al 
1993). Focusing on a smaller number of cases with a more varied and complex 
set of variables also offers a better opportunity to study the dynamics of the 
transition process itself. I also do not explicitly address the debate over whether 
the success of the transition is determined by pre-existing socioeconomic 
structures or by agent-determined tactical choices (Kitschelt 1993, Levine 
1988, Share 1987, Evans & Stephens 1989, Rueschemeyer et al 1992, O'Don- 
nell & Schmitter 1986). Such a dichotomy is not only unproductive, but it 
risks neglecting perhaps the most interesting issue: how the different histories 
of the relevant countries help shape tbe equally important processes of transi- 
tion. 

This review also emphasizes the interaction between domestic politics and 
economic policy and performance; it pays much less attention to discussions 
of international factors. Obviously, the need to achieve the approval of signif- 
icant transnational actors will play a large role in shaping the policy agenda 
of a regime in transition (Stallings 1992, Kahler 1992, Sachs 1991). Others 
have noted how the international environment can determine both the scale 
and timing of an economic crisis and the political responses toward it (Goure- 
vitch 1986). The role of military defeat in establishing democratic regimes has 
not been appropriately studied despite the fact that nearly half the cases, in 
one count of successful democracies, have arisen from a disastrous war or 
invasion (Therborn 1977). External powers may serve as guarantors of political 
and social peace as well as provide demonstration effects (Diamandoros 1986, 
Di Palma 1990, Herz 1982, Appelbaum & Henderson 1992, Rozman et al 
1992). Whitehead (1986) cautions countries in the process of democratization, 
however, against counting too much on external actors for assistance. The very 
democracy of the developed countries may hinder the transition. For example, 
protection of selected industries or farm products and restrictions on immigra- 
tion may improve the electoral chances of OECD governments, but they will 
not help Eastern Europe. Despite the importance of the international environ- 
ment, the consensus is that the key to a successful transition lies in domestic 
politics and policies (Haggard & Kaufman 19922, Schmitter 1986). 

Finally, the focus of this review is the process of transition. While obviously 
the long-term relationship between democracy and the market is of critical 
significance, I have largely restricted my discussion to the problems facing 
nonconsolidated democracies and recently restructured economies. I have also 
emphasized the analysis of the dual transition rather than the attainment of 
either goal. (See Mainwaring 1992 for a review of democratization, and Hag- 
gard & Kaufman 1992a for a review of economic adjustment.) 
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According to many early post-war analyses of the relationship between de- 
mocracy and the market, the countries in the process of transition should not 
face special difficulties. In this perspective, there is no inherent conflict be- 
tween these two goals and "all good things can go together" (Packenham 1973). 
Jeffery Sachs, the leading practitioner of economic "shock therapy," for ex- 
ample, claims that democracy and the market are inextricable (Passell 1993, 
p. 60). From a very different perspective, the simultaneous transition may well 
be impossible: “there is a cruel choice between rapid self-sustained expansion 
and democratic processes" (Bhagwati 1966, p. 204). According to this view, 
the "liberal model of development avoided the problem of choice [between 
economic and political change] by claiming that all desirable values could be 
maximized. But it has turned out not to be a realistic or relevant choice for 
most modernizing countries" (Huntington & Nelson 1976, p. 26). 

The search for a decisive confirmation of either one of these positions has 
been largely unsuccessful. On the simplest level, we may observe that no 
functioning democracy has a nonmarket economy, but that capitalism does not 
guarantee free and fair elections. Establishing cause and effect, however, has 
proven much more difficult. Lipset (1959) was one of the first in a tradition 
which sought to establish an empirical link between democracy and capitalist 
economic development. This was confirmed by Cutright (1963), Cutright & 
Wiley (1969), and Bollen (1979, 1983). Neubauer (1967) and Jackman (1973) 
challenged the optimistic evolutionary model by postulating threshold effects. 
Marsh (1979) found evidence of a positive association between authoritarian- 
ism and economic growth. Dick (1974) suggested that purely repressive re- 
gimes do not do very well, but that a semi-competitive model could be the 
best alternative. Weede (1983) reported some evidence that democracy tends 
to reduce growth, but only in those cases where the government is actively 
involved in the economy. Haggard & Kaufman (1992b) note the many diffi- 
culties faced by democratic regimes attempting to impose harsh economic 
policies and control inflation, but reject a deterministic coupling of economic 
success and regime type. They suggest that stability and length of tenure may 
be decisive. Stallings & Kaufman (1989) find that authoritarian regimes were 
better at dealing with inflation and cutting budgets, but that transitional regimes 
(in either direction) had the most difficulty with debt payments. Remmer 
(1986b) says that authoritarian regimes do not outperform democracies, nor 
are they more likely to impose structural adjustment programs or to survive 
the political repercussions of these. Bienen & Gersovitz (1985) and Bates & 
Krueger (1993) also fail to find a significant relationship between regime type 
and policy outcomes. (See Remmer 1990, Kohli 1986, and Rueschemeyer et 
al 1992 for more extensive reviews). 
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Despite the inconclusive evidence, we may identify three general dilemmas 
facing democratic efforts to impose new economic policies. These dilemmas 
include problems involving collective action, distributive allocation, and the 
institutional interests of political groups (Haggard & Kaufman, forthcoming). 
We begin with the assumption that while all social actors may want the political 
benefits of democratic rule and the economic advantages of the market, the 
pursuit of individual preferences will create a situation where both are im- 
possible (Staniland 1985, Bates 1990). Because of distributive demands, de- 
mocracies are seen as incapable of supporting the required capital accumu- 
lation (Staniland 1985, Martin 1977, O’Donnell 1979) and as unable to cope 
with inflation (Maier 1985). According to this perspective, economic logic 
requires that at least some political demands remain momentarily unsatisfied, 
yet democracies have difficulty saying no. . 

The combination of demands by individual groups and the Downsian be- 
havior on the part of political elites will produce economic policies that are 
driven by electoral rationales rather than by productive efficiency (Hibbs 1977, 
Lewis-Beck 1988, Nordhaus 1975). Excessive demands on the democratic- 
capitalist state weaken its capacity to govern (Brittan 1975, Crozier et al 1975, 
Olson 1982) or may lead to fiscal and legitimation crises (Block 1981). Such 
problems will produce the temptation to rid the population of “bad habits” and 
impose an "enlightened central authority... able to guide people to social 
optimality" (Foxley 1983, p. 102). In the end, whether one sees democracy 
endangering capitalist growth (Huntington 1968) or capitalism dooming dem- 
ocratic participation (Lindblom 1977, O'Donnell 1979), some choice between 
them appears inescapable. 

There are several problems with arguments regarding the economic inepti- 
tude of democracy. We may begin by pointing out the inherent anti-populist 
bias of such critiques. Many in the economics profession appear distressingly 
quick to dismiss the benefits of democracy, especially someone else's (Barry 
1985). In much of the literature on the transition to the market, economic 
dysfunctions are generally described by examples of labor's "lavish" ability 
to consume, but the excesses of the rich are accepted as the inevitable costs 
of trickle down growth (Sheahan 1980). Policies calling for "social order with 
economic progress” often neglect to ask "progress for whom?” (Cardoso 1979). 
Yet, accumulation in one place may simply lead to growth in another (Taylor 
1988). For example, during the 1980s, the efforts of the Latin American 
countries to squeeze every dollar out of their economies largely benefitted the 
creditor banks who received the net capital flow (Canak 1989). The creation 
of an international financial network and the subsequent ease of capital flight 
means that money could always leave without providing the investment sup- 
posedly purchased by the tranquility of an authoritarian regime. 

Much of the accumulation argument also assumes a development led by 
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industry that requires repression of labor. Yet, the dramatic economic successes 
of South Korea and Taiwan may be partly attributable to the creation of a 
successful small peasantry (Amsden 1979, 1989, Gold 1986, Berger 1991). 
The post-Mao reforms in China also benefitted from the early opening of 
agriculture (Shirk 1989, Perkins 1988, Prybyla 1989). An early emphasis on 
agriculture may also avoid some of the urban centered political difficulties 
often cited as impediments to a market democracy (Waterbury 1989, Bates 
1981). The model of capital accumulation which dominates the literature may 
also no longer be relevant in a post-industrial economy. Schneider (1987) 
indicates that while a dominant actor may be necessary for the "easy stage" 
of accumulation, a more dynamic model may be best for later complex tech- 
nological and regional development. In his analysis of the socialist systems, 
Kornai (1979, 1980) contends that the institutional mechanisms designed to 
promote initial accumulation retarded growth in Eastern Europe. Attempts to 
follow a version of nineteenth century industrialization may not only doom 
democratic hopes but also condemn the countries to a marginal status in an 
economy increasingly driven by services (Gereffi 1992). 

The organizational structures required to maintain an authoritarian regime 
might serve to impede the economic rationality they are meant to protect. 
Analyses of the Hungarian and Yugoslavian pre-1989 experiments as well 
as less dramatic reform efforts in the rest of the ex-Soviet block demonstrate 
that the weight of the bureaucracy prevented any meaningful improvement 
in economic performance (Bauer 1983, 1988, Ramet 1985, Nee & Stark 
1989, Bukowski 1988, Lampe 1988). The record of authoritarianism in Latin 
America is not much better (Hartlyn & Morely 1986, Malloy & Seligson 
1987). Conversely, the political structures of a democracy requiring negoti- 
ation and consultation may both promote better designed policies and support 
more efficient implementation (Bresser Pereira, Maravall, & Przeworski 
1993). 

The contradictions noted between democracy and the market may also 
simply reflect a deeper conflict between political and economic rationales 
(Kohli 1989) not inherently absent in authoritarian regimes. Ames (1987), for 
example, indicates that the authoritarian regime in Brazil was no less prone to 
the political use of pork barrels than its democratic counterparts. When many 
of the communist regimes faced both a legitimacy crisis and an economic 
downturn in the mid-1960s, they were unwilling or unable to impose harsher 
budgetary and fiscal measures. Instead they created an increasingly expensive 
social compact which became more difficult to sustain in the late 1970s and 
early 1980s (Bunce 1990). The Hungarian government, for example, dropped 
plans for more drastic economic reform prior to 1989 partly because of oppo- 
sition from the urban working class (Kovrig 1987). Caught between the de- 
mands of the international banks and those of their populations, many of the 
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countries in Eastern Europe reduced their capital investments—precisely the 
expected outcome of democratic "pandering" (Comisso 1986). 

In short, authoritarian regimes do not necessarily provide the “hard-budget 
constraint” that may be critical to the economic transition to the market. 

Rather than claiming a particular democratic inability to co-exist with a 
thriving market, we should stress the problems of regulating the political and 
economic uncertainty accompanying the transition (Di Palma 1990). O’Don- 
nell & Schmitter (1986) also emphasize the need to mitigate fears and provide 
assurances. Przeworski (1991) argues that democracies inherently involve 
uncertain outcomes; the same could be said for markets. Throughout the 
discussions of transition one can find innumerable examples of failure caused 
by the inability of one or more actors to accept the uncertainty involved in a 
more competitive environment or their (often well justified) unwillingness to 
accept any situation that would allow a return to the status quo ante or that 
would overly rely on the good faith of historical adversaries. The combination 
of uncertainty with the need to sacrifice economic or political interests (at least 
in the short run) is particularly difficult. Each actor may be willing to do its 
part but still fears that the subsequent effort will only be used by another for 
individual profit. Such apprehensions do not originate in rational game-playing 
scenarios but are shaped by the historical legacies, socioeconomic positions, 
and cultural expectations of the actors involved. The rest of this article reviews 
three mechanisms that have been suggested for dealing with this uncertainty. 


THREE MECHANISMS 


While any system of categorization will do some injustice to the subtleties of 
scholarly argument, I am proposing that the solutions to the dilemma of the 
double transition as discussed in the literature can be understood as involving 
combinations of three social mechanisms: contracts, domination, and trust. 
These coincide with the three foundations suggested by Przeworski (1991) for 
a working democracy: spontaneous and voluntary participation, the existence 
of some policing mechanism, and a moral commitment. To an extent, my 
categorization also parallels the classic triumvirate of economy, polity, and 
society. But I wish to emphasize not so much a spatial or a functional division 
of the society undergoing transition, but the various institutional mechanisms 
available to resolve the dilemmas discussed above, no matter where they might 
appear. 


Contracts 


Contracts involve explicit compromises and compacts that distribute resources 
in an acceptable manner to the relevant social groups. They seek to balance 
promises of reform with guarantees that the status quo will not change past 
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prescribed thresholds. They reconcile left and right, past and future (Di Palma 
1990). According to Sachs (1990) the failure to create institutions or mecha- 
nisms through which to moderate social conflict accounts for many of the 
problems faced by the Latin American countries. In such situations, the recur- 
ring instability makes it seem that the only logical economic behavior is 
plunder, that the only rational political principle is force (O'Donnell 1988). 
Transition contracts provide what Oliver Williamson (1985) has called 
“governance structures." All the actors in a transition are certainly subject to 
the problems that Williamson has identified in private transactions. First, asset 
specificity: actors’ options for exit are limited; they will likely be stuck both 
with each other and with whatever new world they create. They are also subject 
to the limitations of bounded rationality: they cannot create a perfect contract 
that will take into account all future events and design corresponding institu- 
tions. Finally, all actors fear opportunistic behavior by the others which will 
harm their interests in the long run. Social or elite contracts seek to reduce the 
“transaction costs" of the economic, political, and social transitions by attempt- 
ing to reduce the risk involved (Hamilton & Woolsey Biggart 1988). 
Transition contracts may be embodied in political institutions. Przeworski 
(1991) has defined the institutional needs of new democracies managing eco- 
nomic change: democracies, he argues, last when they can evoke the self-in- 
terested spontaneous compliance from all major political forces. Democracy 
has to offer all actors a chance to compete and the possibility of shaping 
decisions. This is made possible through the negotiated construction of insti- 
tutions allowing the players to agree to disagree. If we wish to understand 
transitions, we have to analyze the process through which these institutions 
are created, taking into account their distributional consequences. Przeworski 
notes, however, that the strains of economic restructuring challenge even the 
best designed institutional structures. Geddes (1991) shares some of Prze- 
worski's rational choice methodological perspectives, which she uses to predict 
the likelihood of administrative reform. According to her, change only occurs 
when the individual interests of the politicians who must initiate them coincide 
with the collective interest of reform. She then demonstrates how different 
electoral structures will produce different propensities for reform. The problem 
with these works is that they tend to disregard the role played by perceptions 
and definitions of equity. Strategic considerations are important, but such 
game-playing takes place in a social environment of expectations and fears. 
Transition bargaining does not occur in a historical vacuum, but reflects pre- 
vious constructions of legitimacy and national identity. We need to pay much 
greater attention to bow these determine the flow of the political game. 
Other students of institutions (defined more traditionally) or constitutional 
contracts have analyzed how the choice of presidential vs parliamentary sys- 
tems and majority vs proportional representation help determine the likelihood 
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of a successful transition (Linz 1990b, Linz, Lipset & Horowitz 1990, Lijphart 
1991). Linz cautions against the dangers of winner-take-all dynamics of pre- 
sidentialism and suggests that parliamentary regimes offer greater room for 
the accommodation. That is, they allow for continued negotiation of the terms 
of the transition, thereby reducing opposition fears and helping assure some 
commitment to the continuation of democratic rule. Lijphart suggests that the 
optimal system for political stability is a "hard" version of proportional rep- 
resentation which allows for representation of minority parties, but also places 
limits on the number of viable parties. One problem in this discussion is that 
while these authors emphasize democratic stability, they say little about which 
system will be best able to impose economic change. Again we note the 
possible contradiction between the two transitions. While parliamentary sys- 
tems may allow for stronger democratic consolidation (Stepan & Skach 1993, 
Bermeo 1994), presidential rule may be necessary in order to impose the social 
costs associated with the turn to the market. The varied experience of countries 
that have attempted economic restructuring from Thatcher’s Britain to 
Menem's Argentina indicates the need for either a very strong executive or a 
practically hegemonic majority in the legislature. 

Contracts may also simply involve negotiated pacts within elite circles (Di 
Palma 1990, Karl 1990, Higley & Gunther 1992, Gillespie 1992). It is im- 
possible to define a uniform set of practices that guarantee success in these 
negotiations because each is associated with a particular historical path (Share 
1987). One apparently universal problem is the need to play “coup poker" with 
forces with access to means of violence, who attempt to obtain the best possible 
deal or else withdraw from the game (O'Donnell & Schmitter 1986). Baloyra 
(1987) refers to these as the politics of the end-game, where a new government 
guarantees the integrity and safety of the old authoritarian institutions and 
personnel. If in Latin America these negotiations involved assurances to both 
the military and the bourgeoisie that often supported it, in Eastern Europe these 
require some protection for the nomenklatura. These constraints may frustrate 
those who might seek the creation of a new social order (Petras 1986). 

Analyses that emphasize strategic behavior and political choices, however, 
may neglect the distribution of resources and power. The political and eco- 
nomic regime produced by a transition will reflect not only the process itself, 
but also the pre-existing conditions under which it took place. The circum- 
stances under which elite or institutional contracts are drawn clearly tend to 
favor one set of actors over others. For example, while capital may be able to 
operate under both democratic and autocratic regimes, labor can assert its 
interests only in a democracy. Santiago Carillo, head of the Spanish Commu- 
nist Party in the 1970s, explains the consequences for the political left: “We 
must have the courage to explain to the workers that it is better to give a 
surplus to the bourgeoisie than to face an even worse fate" (cited in Przeworski 
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1988, p. 74). The most productive analysis of transition contracts may therefore 
be that which links them to class structure and the particular needs and per- 
ceptions of different social sectors. Rueschemeyer et al (1992) offer an excel- 
lent example of this approach as they emphasize the critical role played by the 
working class in the development of democracy. They describe how labor 
historically accepted aspects of capitalist control in exchange for democratic 
participation that could provide opportunities to organize and defend working 
class interests. 


Domination 


While it may be theoretically possible to construct a contract situation with 
enough mutual hostages or one that assures adequate distribution of benefits 
so as to guarantee voluntary compliance, it is difficult to imagine a world of 
contracts without means of enforcement. Too much reliance on exchange of 
political favors as the basis for a transition could create “coalitional stalemates” 
where the government could not act without fear of alienating key allies 
(Kaufman 1990, Bates & Krueger 1993). As pointed out by Moore (1965), the 
path to modernity has historically been quite brutal; the road to capitalism and 
democracy is a “valley of tears” (Di Palma 1990) which may require discipline 
and authority (Di Palma 1990, Dahrendorf 1990). 

We may distinguish between two forms of domination in historical transi- 
tions to democracy and capitalism (de Schweinitz 1964, O'Donnell 1978). In 
the first, society is led by a coherent bourgeoisie, in the second, by a govern- 
ment bureaucracy that has succeeded in fragmenting or coopting private in- 
terests. Either one may serve as the "binding agent” able to define goals, resolve 
contradictions, and choose means (Hirschman 1958, Migdal 1988) with which 
to successfully manage the transition. The first alternative is the path followed 
in the initial creation of capitalist democracy in the nineteenth century. The 
model of a bourgeois domination eliminates many of the conflicts inherent in 
the simultaneous development of market and democracy. Since a single class 
can control both politics and economics, it can develop a “class-wide rational- 
ity" to encompass the needs of both spheres. Conversely, the elimination of a 
class with the power to derail either political or economic change may also be 
a critical step. Both Amsden (1979) and Gold (1986) emphasize the importance 
to Taiwanese economic development of the destruction of the traditional For- 
mosan oligarchy by both the Japanese colonial government and the KMT. A 
similar displacement of a potentially obstructive class took place in Korea and 
Meiji Japan (Amsden 1989, Berger & Hsin-Huang 1988). Zeitlin (1984) pro- 
vides a wonderful historical account of the importance of class coherence and 
domination (or lack thereof) in Chile's development (1984). A similar argu- 
ment (applied to all of Latin America) may be found in Therborn (1979). In 
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Eastern Europe, the absence of such a class may actually require that the 
nomenklatura assume some of its functions (Ost 1993). 

How relevant is this model? None of the revolutions that produced the 
developed states of Western Europe and of North America originally estab- 
lished what today would be called a democracy. Even into the twentieth century 
large segments of these populations were excluded from voting for reasons of 
sex, class, or race (Therborn 1977, Dahl 1989). The conflict between the 
economic and political imperatives facing contemporary states in Eastern 
Europe and Latin America would certainly be less problematic if a state had 
only to consider the wishes of 896 of the population, as was the case in Britain 
in 1867. The lesson here is that a successful transition may require the creation 
of an “exclusionary democracy" (Remmer 19862). This would allow compe- 
tition while limiting participation and liberalize the regime before democra- 
tizing it (Dahl 1971, O'Donnell & Schmitter 1986). Such a strategy may work 
for a limited period, but while the interests of a particular class may dominate 
after the transition, it is difficult to imagine a contemporary regime able to 
maintain explicit restrictions on participation without significant repression, 
much less that it could claim legitimacy as a democracy. As discussed below, 
however, such domination may be expressed in cultural forms allowing a 
formal democracy that disguises effective exclusion. 

In the second alternative, the state assumes many of the roles of a dominant 
class. Some suggest that the key to the successful transition is to depoliticize 
the economy by drastically reducing the state's role within it (Srinivasan 1985). 
In the reasoning of North & Thomas (1973), for example, the way to assure 
the coexistence of markets and democracies is to have a minimal state. Yet, 
there is an increasing consensus that "the state establishes the context in which 
markets operate" (Ordeshook 1990, p. 9) and that in the contemporary devel- 
oping world, the creations of markets are political acts (Bates 1990, p. 49). 
This appreciation of the critical role played by the state continues a tradition 
of recognizing its importance in the early development of capitalism (Polanyi 
1957) and especially in the case of late industrializers (Gerschenkron 1962). 
Despite some significant exceptions (Balassa 1981), for example, most work 
on the East Asian miracles recognizes the critical role played by the state in 
economic development (Deyo 1987, Haggard & Moon 1989, Appelbaum & 
Henderson 1992, Wade 1990). The case of post-Mao China is particularly 
interesting because there we have the phenomenon of a communist version of 
a bureaucratic-authoritarian state creating the conditions for a market revolu- 
tion (Harding 1986, Womack 1984, Perkins 1988). The literature on economic 
reform in África has also been placing much more emphasis on the creation 
of a strong state with corresponding social controls (Diamond 1988, Callaghy 
1990, Sandbrook 1990). 

What are the crucial attributes of the developmentalist state? The first is 
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what Michael Mann (1988) calls infrastructural power or the state's capacity 
to actually impose commands. The second we may call autonomy, or the ability 
to issue commands independent of social forces; this allows the state to resolve 
some of the collective action problems discussed by Olson (1982). The best 
sources for the comparative analysis of these issues are the collections edited 
by Haggard & Kaufman (19922) and by Nelson (1989, 1990); they feature an 
explicit focus on policy outcomes and processes in countries undergoing tran- 
sition. Important factors in determining state capacity include elite cohesion, 
creation of a meritocratic bureaucracy, and organizational design of the state 
apparatus (Evans 1989). The quality of political leadership and its ability to 
impose the new political and economic rationale may also be crucial to the 
success of a transition (Hanson Leung 1985, Squires Meany 1986, Centeno 
1994). 

With regard to autonomy, many authors emphasize the need to insulate 
decision makers from the "pull of distributive politics" or social pressures for 
a particular policy alternative (Haggard & Kaufman 1992a, Callaghy 1989, 
Amsden 1979, 1989). But Evans (1990) also cautions against too much insu- 
lation; the state must remain embedded in the society in order to ascertain 
public reaction to policies and the relative success of these. The former Soviet 
Union, for example, represented a case of too much autonomy (Poggi 1990, 
Connor 1988). 

The combination of capacity and autonomy often produces both a repressive 
check on popular demands and an effective control over domestic capital which 
promotes long-term investment and discourages speculation and expatriation. 
Such states do not fit neatly into categorizations such as "laissez faire" or 
"welfare state." Instead they promote what Callaghy (1989) calls *embedded 
liberalism" in which market and state balance each other and in which eco- 
nomic forces are embedded in political realities. 

Can democracy survive in such states? Far from being an obstacle, a strong 
state may be necessary for a stable democracy (Diamond & Linz 1988). None 
of the attributes discussed above necessarily requires an authoritarian regime. 
The Japanese case demonstrates that a developmentalist state can co-exist with 
many of the characteristics of democracy (even if at times it has skirted its 
definition as such). The role of the Liberal Democratic Party may even suggest 
that under certain circumstances hegemonic parties may provide the best bridge 
witb which to balance the political and economic logics of the transition. There 
is also considerable evidence of a negative relationship between democracy 
and inequality (Rubinson & Quinlan 1977, Muller 1988), and a democratic 
state must be strong enough to be able to tax the rich sufficiently to provide 
adequate safety nets without resorting to the populist-authoritarian cycle (Sachs 
1990). Certainly in the process of transition, the state must be able to control 
the rapacity of those with initial access to resources if it is to avoid both popular 
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disappointment with democracy and the creation of an enclave economy with 
little chance for development (Bresser Pereira et al 1993). There is always the 
temptation, however, to postpone democracy indefinitely while the techno- 
cratic guardians perfect a transition to the market (Centeno 1993). 

Finally, we should recall Aristotle's contention that in a democracy the 
people must both be obeyed and learn to obey. Not enough attention has been 
paid to the need to establish an accepted and respected code of law which 
permits citizens not only to safely challenge authority and thereby defend 
democracy, but also to confidently enter into market relationships with each 
other (Kornai 1990, Lewandowski & Szomburg 1989). The creation of a legal 
system may reduce the uncertainty accompanying the transition (Litwack 
1992). A body of law perceived as above particular interests, whose authority 
is beyond question, may help prevent divisive debate, particularly during the 
distribution of state property (Suleiman & Waterbury 1990, Frydman & Rapa- 
czynski 1990, Gruszecki & Winiecki 1991). 

The establishment of a Weberian authority, however, may be a mixed 
blessing for many. For example, Voskamp & Wittke (1991) and Burawoy & 
Krotov (1992) describe the lack of control over labor on the factory floor 
during much of the late Soviet period. The imposition of Western managerial 
techniques and demands may actually lead to a decline in the occupational 
license of industrial workers. The need to respect state authority may also 
frustrate those for whom infractions represented a form of rebellious expres- 
sion. The need to accept new limits on personal freedom and to accede to a 
possibly unfair new social order will place a great deal of strain on the 
transition. The next section discusses those social relationships that might 
provide the glue to keep society together. 


Trust 


Much of the literature on the transition in both Eastern Europe and Latin 
America speaks of a return of "civil society" as critical to both the collapse 
of the authoritarian regime and the creation of a new order (Stepan 1989, 
O'Donnell & Schmitter 1986, Judt 1988, Garton Ash 1990, Goldfarb 1992, 
Pérez-Diaz 1993). Each author has his or her own definition of what is meant 
by "civil society" and the way it contributes to a successful transition. One 
common element is the belief that the success of the systems of contracts and 
domination described above will partly depend on a social atmosphere of 
mutual respect and a consensus on principles of fair play and good faith 
(Lindberg 1985, Bruszt & Stark 1992, North 1981). Transitions may require 
enough solidarity, unity, and sense of common purpose to allow citizens to 
accept not only limitations on their own behavior vis à vis other groups, but 
more importantly, support a faith that other groups share a similar commitment. 
The simultaneous creation of political voices and potential market exits may 
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depend on a foundation of loyalty or trust. Giddens (1990) notes that trust in 
the “abstract capacities” of a social system is critical for its development. The 
key to much of social order during a transition may be reliability and predict- 
ability based on mutual confidence. 

To an extent, trust is a gift of history and circumstance. Obviously, societies 
with relatively equal distributions of wealth and income may find it easier to 
establish collective goals and assign corresponding responsibilities, making 
both democracy and economic development easier (Katzenstein 1985, Muller 
1988). Such conditions are not given but are the creation of a complicated 
process full of historical contingencies. Research on corporatism and welfare 
states in Europe indicates that the societies in transition cannot expect to 
duplicate such a finely calibrated system of institutional compromises in the 
short term (Berger 1981, Scholten 1986, Cameron 1978, Lange & Garrett 1985, 
Przeworski 1980). 

Societies with homogenous populations and cultures may also more easily 
avoid turning every policy decision into a debate on the distribution of the 
social pie (but we should avoid overemphasizing ethnic divisions and the use 
of vague notions of "tribalism" to explain failure). Many of the post-war 
discussions of political development as well as more recent work in neo- 
modernization would suggest that the underlying cultural values are determi- 
nant (Huntington & Dominguez 1975, Berger & Hsin-Huang 1988, Berger 
1991). We should not assume, however, the permanence of particular attitudes 
or values, but again, we should recognize their historical origins. What are 
often described as Japanese traditions, for example, are really expressions of 
a mythologized political ethos created during the Meiji period (Morishima 
1982, Munakata 1988). 

Cultural arguments must be used with great care as they can easily be 
reduced to often racist stereotypes depicting inflexible and determinant “civ- 
ilizations.” Yet the straw man of modernization theory did have a brain and 
was never the simplistic model often described (Eisenstadt & Rokkan 1973). 
Analyses which “bring culture back in" do not necessarily follow simplistic 
linear-evolutionary models (Holm 1988, Lipset 1988); rather, they discuss the 
interaction between political and economic practices and particular cultures. 
Research in Latin America has strongly challenged the notion that some 
authoritarian cultural traditions made democracy unlikely (Booth & Seligson 
1984, Tiano 1986). Yet, as Kathryn Sikkink (1991) has so ably demonstrated, 
ideas and perceptions of development models' costs and benefits do go far to 
explain economic policy. 

We have less information on the creation of a “market culture." Certainly 
even a short visit to Eastern Europe, and especially the former Soviet Union, 
would suggest that these societies will face particular difficulties. Even if we 
accept the pursuit of individual self-interest as a universal human trait (ques- 
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tionable at best), capitalism requires much more: the methodical rational pur- 
suit of wealth under some form of legal structure. Possible cultural hurdles to 
both democracy and the creation of a viable market in Eastern Europe include 
the ethos of imposed equality (Bunce 1990) as well as the legacy of lying and 
deception and the manner in which this contributed to the creation of a dys- 
functional civil society (Goldfarb 1989, Jowitt 1992, Kohak 1992). Already 
some research has attempted to define the values of a population in transition 
(Hout et al 1992, McDonough et al 1986), but we need much more. For 
example, I know of no current research on the attitudes of those who oppose 
transition either to democracy or to capitalism. In the case of Eastern Europe, 
it has been too easy to dismiss such efforts as nostalgic or demagogic. 

Research on the cultural foundations of transition might focus less on values 
and more on a deeper layer of frameworks, causal assumptions, and taken-for- 
granted meanings that shape visions of a just society (Swidler 1986, Wuthnow 
1985). Following the work on the new institutionalism in organizational theory, 
we might also emphasize the acquisition of cognitive processes rather than 
socialization (DiMaggio & Powell 1992). We should also begin to pay better 
attention to rhetorical claims by the different actors and use these to ascertain 
shared assumptions about the nature of the society and the perceived direction 
of change (Hirschman 1991). Obviously, we also need to better define the 
interactions between beliefs and practices, and between ideology and social 
structure (Lachmann 1989). Finally, we might also move away from a concern 
with Parsonian dichotomies and emphasize patterns of social co-operation, 
trust, and tolerance. Putnam's work on Italy (1992) reveals that the presence 
or absence of a "civic community" helps explain the ability of governments 
to create a viable social and economic order independent of differences in 
parties, wealth, or urbanity. Inglehart (1988) also presents considerable evi- 
dence of a link between political satisfaction, interpersonal trust, and support 
for the existing social order, and the stability of democracy. 

I believe that the most promising area of research is on what may be called 
the social-construction of trust or the creation of legitimacy. The question 
should not be whether a particular nation is “ready” for democracy or the 
market, but how a minimal social consensus develops about the rules involved 
in both (Rustow 1970, Linz 19902, Di Palma 1990). While the historically 
given cultural attributes that may be best suited for democracy and the market 
have been extensively studied and debated, less attention has been paid to the 
process of creating the "imagined community" that could manage the transi- 
tion. 

Culturally centered analysis must remain cognizant of the role played by 
the distribution of resources in the creation of a new civil society. The suc- 
cessful combination of an emphasis on ideas with a sensitivity to power could 
bring about a Gramscian synthesis which still eludes most work on the tran- 
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sition. Rueschemeyer et al (1992) suggest one strategy with their focus on the 
social construction of class interests. Another direction for future research may 
be found in studies of what Kahler (1990) calls the intellectual lenses of policy 
makers. Kahler's work as well as that of Drake (1989), Hall (1989), Taylor 
(1988), and Foxley (1983) describes the congruence between intellectual for- 
mulations and political and social interests. Neoliberal policies, for example, 
are not just neutral, technical responses to a given type of crisis; rather, they 
reflect a process of socialization and cognitive development (Conaghan et al 
1990). We also need much more research on the role of international actors 
in the creation of "coherent and uniform sets of ideological and organizational 
standards for national behavior" (Boli-Bennett 1980, p. 87). As Meyer (1980) 
points out, the pressure to support the "correct" policies is strengthened by a 
cultural world system that helps define what is legitimate state action. Fajnzyl- 
ber (1990) and Dore (1990), for example, contrast the Japanese and American 
models of management and industrial policy and suggest that some of tbe 
difference between the development patterns of East Asia and Latin America 
may reflect the different choices of paradigms. 


CONCLUSION 


The creation of a social contract, a dominant but democratic state, and an 
atmosphere of trust and cooperative behavior will not in and of themselves 
guarantee a successful transition. Each depends in part on the presence of the 
others. A state able to guarantee contracts will help foster an atmosphere of 
trust. A shared faith in the future will support the development of a social 
compact. À sense of common identity and an acceptance of a class compromise 
will facilitate the creation of a legitimate state authority. Depending on their 
historical legacies, societies in transition may wish to focus on one aspect 
before the others. For example, in the ex-Soviet Union, the solidification of 
political authority is critical, while Latin America needs to formulate a new 
class compromise allowing for a more equal distribution of resources. 

I would suggest that future work on transitions should focus on the interac- 
tion between the three mechanisms and the manner in which it helps link the 
structural legacy of the past with the strategic agency of the present. We might 
also wish to reconsider the unit of analysis for that interaction. One of the most 
striking aspects of the literature on transition is how little we know of what it 
involves on a daily basis. We have quite a large amount of data and analysis 
of the *macro" side of the transition, but relatively little on the "micro." We 
speak of "adjustment" as if the societies were psychiatric patients requiring a 
bit of therapy to return to normal. But we know very little about what that 
normality means or about the strategies used by the population in coping with 
the change. We use words such as “market” and “democracy” as if these meant 


DILEMMAS OF THE DOUBLE TRANSITION 141 
the same thing in Moscow, Madrid, or Manila. Far too much of our analysis 
of the transitions (including this review) also assumes a rationalistic perspec- 
tive that neglects the raw emotional impact of these changes. The collapse of 
a moral order complete with childhood socialization, treasured symbols, ac- 
cepted beliefs, and sentimental associations cannot be treated as a simple 
choice between past and future, better and worse living conditions, or even 
liberty and coercion. How the new regimes manage the balance between 
changing enough of the old order, but preserving some of the sacred symbols 
may hold the key to the success of a transition. In the final analysis, the success 
of the political and economic revolutions of the late twentieth century will rest 
with what they come to mean to those who must bear the cost of change. 

I suspect that answers to the critical issues of the double transition will not 
be found in computer rooms, library archives, closed ended surveys, or stra- 
tegic games, but will instead require field work. As students of the transition, 
we should be prepared to spend extended periods of time simply listening to 
the societies as they undergo this process. A night in a bus station in Peru, a 
day spent in a farmer's market in Russia, a few hours in the high rise slums 
of Malaysia may provide more insights than the finely crafted data of the latest 
IMF report. Such an approach combined with a theoretical framework attuned 
to both domestic and international pressures, economic logic and political 
necessity, social power and cultural construction, may shed the most light on 
the limits and possibilities of our contemporary great transformation. 
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Abstract 


Social problems may fruitfully be looked at as constructed phenomena, that 
is, what constitutes a problem is the concern that segments of the public feel 
about a given condition. From the constructionist perspective, that concern 
need not bear a close relationship with the concrete harm or damage that the 
condition poses or causes. At times, substantial numbers of the members of 
societies are subject to intense feelings of concern about a given threat which 
a sober assessment of the evidence suggests is either nonexistent or consider- 
ably less than would be expected from the concrete harm posed by the threat. 
Such over-heated periods of intense concern are typically short-lived. In such 
periods, which sociologists refer to as "moral panics," the agents responsible 
for the threat—"folk devils"——Aare stereotyped and classified as deviants. What 
accounts for these outbreaks or episodes of moral panics? Three theories have 
been proposed: grassroots, elite-engineered, and interest group theories. Moral 
panics are unlike fads; though both tend to be relatively short-lived, moral 
panics always leave an informal, and often an institutional, legacy. 


‘This chapter was adapted from Moral Panics by Erich Goode and Nachman Ben-Yehuda, 
Oxford, England: Blackwells, 1994. Permission to adapt this material is acknowledged. 
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INTRODUCTION 


On the continent of Europe, roughly between 1400 and 1650, hundreds of 
thousands of people—perhaps as many as half a million, up to 8596 of whom 
were women—were judged to have “consorted with the devil” and were put 
to death. Much of Europe, especially France, Switzerland, and Germany, was 
in turmoil with suspicion, accusations, trials, and the punishment of supposed 
evildoers. A kind of fever—a craze or panic—concerning witchcraft and ac- 
cusations of witchcraft swept over the land. Once an accusation was made, 
there was little the accused could do to protect herself. Children, women, and 
“entire families were sent to the stake.... Entire villages were exterminated.... 
Germany was covered with stakes, where witches were burning alive.” Said 
one inquisitor, “I wish [the witches] had but one body, so that we could burn 
them all at once, in one fire!” (Ben-Yehuda 1985:36, 37). 

In 1893, a charismatic religious mystic led a group of faithful followers into 
a remote mountain valley in the Brazilian state of Bahia and founded the 
community of Canudos; within two years, the settlement became the second 
largest city in the state. Canudos was a millenial cult whose adherents believed 
that the existing order would be overturned with the dawning of a new day. 
Landowners, the Catholic Church, and political elites resolved to crush the 
movement. Three military assaults against the settlement were repulsed by 
tenacious defenders. Finally, in October, 1897, Canudos was encircled by 8000 
troops, serving under three generals and Brazil's Minister of War, and was 
bombarded into submission by heavy artillery. Thousands were killed; the 
survivors numbered only in the hundreds. Soldiers smashed the skulls of 
children against trees; the wounded were drawn and quartered, hacked to pieces 
limb by limb. All 5000 houses in the settlement were “smashed, leveled and 
burned.... The army eradicated the remaining traces of the holy city as if it 
had housed the devil incarnate" (Levine 1992:190). Throughout the campaign, 
news of Canudos flooded the Brazilian press; a sense of "public panic" was 
created. Accounts appeared daily, "almost always on the front page." More 
than a dozen major newspapers sent war correspondents to the front and "ran 
daily columns reporting events." "Something about Canudos provoked anxiety, 
which would be soothed only by evidence that Canudos had been destroyed" 
(Levine 1992:24). 

These historical episodes represent explosions of fear and concern at a 
particular time and place about a specific perceived threat. In each case, a 
specific agent was widely felt to be responsible for the threat; in each case, a 
sober assessment of the evidence concerning the nature of the supposed threat 
forces the observer to the conclusion that the fear and concern were, in all 
likelihood, exaggerated or misplaced. Sociologists refer to such episodes as 
moral panics. They arise as a consequence of specific social forces and dy- 
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namics. They arise because, as with all sociological phenomena, threats are 
culturally and politically constructed, a product of the human imagination. 


Social Problems: Objectivism or Constructionism? 


Social problems can be approached from either an objectivist or a construc- 
tionist perspective. Objectivists argue that what defines a social problem is the 
existence of an objective, concretely real, damaging or threatening condition. 
What makes a condition a problem is that it harms or endangers human life 
and well-being. Any condition that causes death or disease, that shortens life 
expectancy or deteriorates the quality of life on a large scale, must be defined 
as a social problem (Manis 1974, 1976). The functionalist paradigm is a variant 
of the objectivist model in that it sees social problems largely as a product of 
dysfunctions, social disorganization, role and value conflicts, and a violation 
of norms—that is, a discrepancy between what is and what ought to be (Merton 
& Nisbet 1976). Likewise, the traditional Marxist position accepts the notion 
that social problems should be defined objectively—by the harm that is in- 
flicted on large numbers of people as a result of injustices such as exploitation, 
oppression, racism, sexism, and imperialism (Liazos 1982). 

On the other side of the debate, representing the constructionist, “subjectiv- 
ist," or “relativist” position, it is argued that what makes a given condition a 
problem is the "collective definition" of that condition as a problem, that is, 
the degree of felt concern over a given condition or issue. To the construc- 
tionist, social problems do not exist objectively; they are constructed by tbe 
human mind, called into being or constituted by the definitional process (Spec- 
tor & Kitsuse 1977, Schneider 1985, Best 1989, Holstein & Miller 1993, Miller 
& Holstein 1993). The objective existence of a harmful condition does not, by 
itself or in and of itself, constitute a social problem. Indeed, to the construc- 
tionist, a given condition need not even exist to be defined as a social problem 
(Becker 1966:5)—witness the persecution of witches in Renaissance Europe 
and colonial New England (Erikson 1966, Ben-Yehuda 1980, 1985). In short, 
a social problem is “the activities of individuals or groups making assertions 
of grievances and claims with respect to some putative conditions" (Spector 
& Kitsuse 1977:75). Definitions of social problems derive from or are produced 
by specific sociocultural circumstances, groups and categories, social struc- 
tures and societies, historical eras, individuals, and/or classes. 

To the constructionist, it is not necessary for individuals to formulate or use 
the precise phrase, "X is a social problem in this society." The problemhood 
of conditions may be expressed in a variety of ways—including expressed 
attitudes, activism, voting on issues, participation in social movements, rebel- 
lion, consuming media stories about certain issues, and so on. More concretely, 
to the constructionist, the reality of social problems can be measured or man- 
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ifested in some of the following ways: (i) organized, collective action or 
campaigns on the part of some of the members of a society to do something 
about, call attention to, protest, or change (or prevent change in) a given 
condition—in short, “social problems as social movements" (Mauss 1975, Best 
1990:2—3); (ii) the introduction of bills in legislatures to criminalize or other- 
wise deal with, the behavior and the individuals supposedly causing the con- 
dition (Becker 1963:135ff, Gusfield 1963, 1981, Best 1990:2-3); (iii) the 
ranking of a condition or an issue in the public's hierarchy of the most serious 
problems facing the country (Best, 1990:2-3, 151-75, Goode 1993:49-50); and 
(iv) public discussion of an issue in the media in the form of magazine and 
newspaper articles and television news stories, commentaries, documentaries, 
and dramas (Becker 1963:141—43, Best 1990:2-3, 87-111). 

The strict or "hard" constructionist focuses exclusively on the nature of 
claims-making and considers all “objectivistic” accounts, including those of 
the scientist and technician, as simply yet another claims-making activity 
(Aronson 1984, Kitsuse & Schneider 1989). In contrast, to the moderate or 
contextual constructionists (Best 1989), one of the more intriguing features of 
social problems is the fact that extremely harmful conditions may not be 
regarded as serious social problems, while relatively benign ones are. The 
answer as to why people become concerned about certain conditions cannot 
be found, at least not entirely, in the realm of objective damage. It is the 
discrepancy between concern and the concrete threat posed by or damage 
caused by a given condition that forces us to raise the question, why the concern 
over one issue but not another? Or, why concern now but not previously? If 
we insist that we have no right to determine the nature of the threat posed by 
certain conditions, such questions are not problematic—indeed, they are not 
even possible. 

If public concern is a logical, almost inevitable, product of impending or 
concrete, inflicted harm, then that concern is not problematic, not a phenom- 
enon necessitating an explanation. If, on the other hand, objective harm and 
public concern vary in large measure independently of one another, then this 
concern demands an explanation. How do definitions of social problems come 
about? Why is a social problem "discovered" in one period rather than another? 
What steps are taken, and by whom, to remedy a given condition? Why do 
segments of the society take steps to remedy this condition but not that, even 
more harmful, one? Who wins, and who loses, if a given condition is recog- 
nized as a social problem? How do groups, classes, or segments of the society 
struggle to establish their own definition of social problems? How is pro- 
blemhood established? And by whom? Constructionism forces us to see the 
social dynamics behind the creation of conditions as problems. And the moral 
crusade is one crucial avenue through which certain conditions are constructed 
and validated as social problems. 
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In 1930, only 16 states in the United States had laws on the books which made 
marijuana possession and sale a crime; even in the criminalized states, there 
was "relatively lax" enforcement of the laws; and few Americans smoked 
marijuana or knew anyone who did. The dominant attitude toward marijuana 
use was apathy and indifference (Becker 1963:137, 135). But by 1937, every 
one of the then-48 states had passed a law outlawing marijuana possession, 
and during that year, a federal law, the Marihuana Tax Act, was passed. For 
the first time in history, Americans came to be arrested for marijuana posses- 
sion and sale in large numbers. During the 1930s, scores of sensationalistic 
magazine articles, and thousands of local newspaper articles, were published 
on the horrors of marijuana use, dubbing the drug the "killer weed," the “weed 
of madness," a "sex-crazing drug menace," the "burning weed of hell," a 
*gloomy monster of destruction" (Goode 1993:194, 1994:179). 

Why this dramatic shift? How was it that this rarely used and relatively 
innocuous drug attracted legislative, police, public, and media attention? How- 
ard Becker (1963:135—46) argued that the passage of the marijuana laws, and 
the attendant media and public attention, was a product of the efforts of key 
moral entrepreneurs, specifically officials in the Federal Bureau of Narcotics 
(FBN), who "perceived an area of wrongdoing that properly belonged in their 
jurisdiction and moved to put it there" (Becker 1963:138). The Bureau's thrust 
was made up of two prongs. First, FBN officials worked with state legis- 
latures—including drafting model legislation—to facilitate the passage of the 
state anti-marijuana laws. And second, they provided "facts and figures" to 
the media which formed the basis for articles in national magazines. Thus, 
"through the press and other communications media," the Bureau sought to 
generate “a favorable public attitude toward the proposed" law (Becker 1963: 
139). The “national menace" or threat posed by marijuana use did not have 
an objective reality. The FBN created a crisis where no basis for it existed, 
and the campaign created a “new class of outsiders—marihuana users" (Becker 
1963:145). 

In the campaign to criminalize the sale and possession of marijuana, a 
specific form of behavior was defined as a major problem by officials, moral 
crusaders, the press, and/or segments of the public. And a specific form of 
wrongdoing—deviant behavior—was seen to be responsible for the problem 
spelled out by the campaign. Specific individuals—marijuana users and deal- 
ers—were targeted as deviants. And the furor of intense concern in some 
quarters that emerged over the issue subsided after a fairly brief period of time. 
Official, media, and public concern about marijuana faded by the early 1940s 
and remained at a low ebb throughout the 1950s and into the early 1960s. 

The moral crusades concept is powerful; it permits the sociologist a pene- 
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trating look at crucial events and developments of importance. But, as with all 
sociological concepts, it is limited; it fails to capture some central features of 
analogous phenomena. Do all moral crusades capture the fervor and concern 
of substantial segments of the public? If not, may we still refer to them as 
moral crusades? Are all phenomena that capture this fervor the product of an 
organized group of key powerful agents and actors? If not, how are we to refer 
to such a phenomenon? The notion of a crusade requires crusaders, or what 
Becker (1963:147ff) refers to as “moral entrepreneurs,” that is, organizers, 
do-gooders, movement activists who push for a given cause. To put things 
another way, the moral entrepreneur creates the crusade: no entrepreneur, no 
crusade. By definition, the latter is a product of the former. But what if we 
cannot locate these agents? Clearly, the engineered, consciously created quality 
of the widespread public concern is an empirical, not a definitional, question. 
If we have posited widespread concern, in principle that concern could have 
been triggered by a variety of sources. It cannot, by definition, be a conscious 
creation of activists. Can a moral crusade grow out of a latent or expressed 
public concern—Tather than, automatically, the other way around? By the 
1960s, it became clear that the moral crusades concept had to be extended and 
built upon. 


Enter the Moral Panics Concept 


While certain institutions and behaviors may exist before they are conceptu- 
alized and named, these processes permit them to be observed and analyzed. 
Seemingly irrational mass behavior was certainly noticed in the collective 
behavior literature as far back as Charles Mackay and Gustave LeBon, but the 
concept of moral panics as an analytically distinct rather than analogous social 
form made its appearance only in the 1960s. 

On a cold, wet Easter Sunday, 1964, in Clacton, a seaside resort community 
on England's southern coast, what would normally be regarded as a minor 
disturbance among young people broke out on the street. Several scuffles and 
brief rock-throwing incidents took place; motorbikes and motorscooters roared 
up and down the street; some windows in a dance hall were smashed; several 
beach huts were damaged; a starter's pistol was fired in the air. The police, 
unaccustomed to such rowdiness, arrested nearly 100 youths on charges rang- 
ing from "abusive behavior" to resisting arrest. 

While not exactly raw material for a major story on youth violence, the 
seaside disturbances nonetheless touched off what can only be described as an 
orgy of sensationalism in the British media. On Monday, the day following 
these events, every national newspaper (with the exception of the staid London 
Times) ran a lead story on the Clacton disturbances. “Day of Terror by Scooter 
Groups," screamed the Daily Telegraph; *Youngsters Beat Up Town" claimed 
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the Daily Express; the Daily Mirror chimed in, "Wild Ones Invade Seaside." 
On Tuesday, press coverage was much the same. Editorials on the subject of 
youth violence began to appear. The Home Secretary was urged to take firm 
action to deal with the problem. Articles began to appear featuring interviews 
with Mods and Rockers, the two youth factions current in Britain at the time, 
who were involved in the scuffles and the vandalism. Theories were articulated 
in the media, attempting to explain what was referred to as mob violence. 
Accounts of police and court actions were reported; local residents were 
interviewed concerning the subject, their views widely publicized. The story 
was deemed so important that much of the press around the world covered the 
incidents. Youth fights and vandalism at resorts continued to be a major theme 
in the British press for some three years. Each time a disturbance broke out, 
the same exaggerated, sensationalistic stories were repeated. 

The over-heated reaction of the police, the media, the public, politicians, 
and, in time, action groups and proto-social movement organizations, caught 
the attention of Stanley Cohen. To Cohen, the major issue was the “funda- 
mentally inappropriate" reaction by key social actors in key sectors of the 
Society to relatively minor events. The press, especially, had created a horror 
story practically out of whole cloth. The seriousness of events were exagger- 
ated and distorted—in terms of the number of young people involved, the 
nature of the violence committed, the amount of damage inflicted, and their 
impact on the community and the society as a whole. Obviously false stories 
were repeated as true; unconfirmed rumors were taken as fresh evidence of 
further atrocities. Once the atrocities were believed to have taken place, a 
process of sensitization was set in motion, whereby extremely minor distur- 
bances became the focus of press and police attention, captured in the headline 
at the time: "Seaside Resorts Prepare for the Hooligans' Invasion." And often, 
Cohen argued, the sensitization process generated an escalation in the distur- 
bances; a minor incident became a more substantial one through overzealous 
enforcement. 

Cohen launched the term moral panic to characterize the reactions of the 
media, the police, the public, politicians, and action groups to the youthful 
disturbances. Said Cohen, in a moral panic: 


A condition, episode, person or group of persons emerges to become defined as 
8 threat to societal values and interests; its nature is presented in a stylized and 
stereotypical fashion by the mass media; the moral barricades are manned by 
editors, bishops, politicians and other right-thinking people; socially accredited 
experts pronounce their diagnoses and solutions; ways of coping are evolved or... 
resorted to; the condition then disappears, submerges or deteriorates and becomes 
more visible. Sometimes the subject of the panic is quite novel and at other times 
it is something which has been in existence long enough, but suddenly appears 
in the limelight. Some times the panic passes over and is forgotten, except in 
folklore and collective memory; at other times it has more serious and long-lasting 
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repurcussions and might produce such changes as those in legal and social policy 

or even in the way society conceives itself (Cohen 1972:9). 

In a moral panic, the reactions of the media, law enforcement, politicians, 
action groups, and the general public are out of proportion to the real and 
present danger a given threat poses to the society. In response to this exagger- 
ated concern, "folk devils" are created, deviant stereotypes identifying the 
enemy, the source of the threat, selfish, evil wrongdoers who are responsible 
for the trouble. The fear and heightened concern are exaggerated, that is, are 
above and beyond what a sober empirical assessment of its concrete danger 
would sustain. Thus, they are problematic, a phenomenon in need of an ex- 
planation; they are caused by certain social and political conditions that must 
be identified, understood, and explicated. 

To Cohen, in a moral panic, sensitization occurs. Sensitization is the process 
whereby harm, wrongness, or deviance is attributed to the behavior, condition, 
or phenomenon that is routinely ignored when the same consequences are caused 
by or attributed to more conventional conditions. The “cue effect" of the 
condition or behavior is much greater during the moral panic; supposed effects 
are noticed and linked to the offending agents that, in ordinary times for ordinary 
behavior, would have disappeared in the routines and hubub of everyday life. In 
addition, the police “escalate” their law enforcement efforts, “diffuse” them from 
precinct to precinct, and “innovate” new methods of social control (1972:86—91); 
they operate under the “widening-the-net” principle (1972:94). 

If all social fears and concerns entailed reactions to a specific, clearly 
identifiable, and appropriate or commensurate threat, the magnitude of which 
can be objectively assessed and readily agreed-upon, such reactions would 
require no explanation. On the other hand, if, as Cohen argues, the reaction is out 
of proportion to the threat, we are led to ask why it arises. Why is there a moral 
panic over this supposed threat but not that, potentially even more damaging, 
one? Why does this cast of characters become incensed by the threat the behavior 
supposedly poses, but not that cast of characters? Why a moral panic at this time, 
but not earlier and not later? What role do interests play in the moral panic? What 
does the moral panic tell us about how society is constituted, how it works, how 
it changes over time? Cohen’s concept introduces the student of society to a wide 
range of questions and potential explorations. 


Moral Panics: Definition and Criteria 


What characterizes a moral panic? How do we know when a moral panic has 
taken hold in a society at a specific time? How may we operationalize the 
concept? The moral panic is defined by at least five crucial elements or criteria. 


CONCERN First, as we saw, there must be a heightened level of concern over 
the behavior (or supposed behavior) of a certain group or category and the 
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consequences that that behavior presumably causes for the rest of the society. 
As with social problems, this concern is manifested or measureable in concrete 
ways, through, for example, public opinion polls, media attention, proposed 
legislation, action groups, or social movement activity. 


HOSTILITY Second, there must be an increased level of hostility toward the 
category of people seen as engaging in the threatening behavior. Members are 
collectively designated as the enemy of respectable, law-abiding society; their 
behavior is seen as harmful or threatening to the values, interests, way of life, 
possibly the very existence, of the society, or a sizeable segment of that society. 
These deviants are seen as responsible for the threat. A dichotomization be- 
tween "them" and “us” takes place, and this includes stereotyping—generating 
“folk devils” or villains on the one hand, and folk heroes on the other, in this 
morality play of evil versus good (Cohen 1972:11—12). 


CONSENSUS Third, there must be a certain minimal measure of agreement in 
the society as a whole or in designated segments of the society that the threat 
is real, serious, and caused by the wrongdoing of group members and their 
behavior. This sentiment must be fairly widespread, although the proportion 
of the population who feels this way need not even make up a majority. 
Differently put: Moral panics come in different sizes—some gripping only 
certain social categories, groups, or segments, others causing great concern in 
the majority. Some discussions (for instance, Zatz 1987) do not posit wide- 
spread public concern as an essential defining element of the moral panic, 
while others (Hall et al 1978) make the assumption that public concern is little 
more than an expression or epiphenomenon of elite interests. It is necessary 
to remind ourselves, however, that many elite-generated campaigns do not 
capture the public imagination, and never become wide-spread moral panics— 
witness the "family values" theme on which Republican candidate George 
Bush initially based his campaign in the 1992 American presidential race, 
which had little resonance for most voters. In addition, the general public, or 
segments of the public, have interests of their own, and often become intensely 
concerned with threats that elites would prefer be ignored, such as nuclear 
contamination (Perrow 1984:324—28, Erikson 1990) and satanism (Richardson 
et al 1991, Jenkins & Meier-Katkin 1992, Victor 1993). Public concern cannot 
be swept under the rug as an irrelevant criterion. Still, in arguing that a measure 
of consensus is necessary to define a moral panic, we do not mean to imply 
that panic seizes everyone, or even a majority of the members of a society at 
a given time. Even during moral panics, public definitions are fought over, 
and some of them win out among one or another sector of the society, while 
others do not. 
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DISPROPORTIONALITY Fourth, there is the implicit assumption in the use of 
the term moral panic that the concern is out of proportion to the nature of the 
threat, that it is, in fact, considerably greater than that which a sober empirical 
evaluation could support; in the moral panic, “objective molehills have been 
made into subjective mountains" (Jones et al 1989:4). In moral panics, gener- 
ating and disseminating numbers is important (Best 1990:45—64)— addicts, 
deaths, dollars, crimes, victims, injuries, illnesses, total cost—and most of the 
figures cited by moral panic claims makers are wildly exaggerated. 

The criterion of disproportionality is not without its critics (Waddington 
1986). How do we know that the attention accorded a given issue or phenom- 
enon is disproportional to the concrete or objective threat it poses? Here are 
four indicators or criteria. 

First, if the figures that are cited to measure the scope of the problem are 
grossly exaggerated, we may say that the criterion of disproportionality has 
been met (Ben-Yehuda 1986, 1990:97—134, Best 1990:45—64). Second, if the 
threat that is feared is, by all available evidence, nonexistent, we may say that 
the criterion of disproportionality has been met (Richardson et al 1991, Jenkins 
& Meier-Katkin 1992, Victor 1993). Third, if the attention paid to a specific 
condition is vastly greater than that paid to another, and the threat or damage 
caused by the first is no greater than, or is less than, the second, the criterion 
of disproportionality may be said to have been met (Goode 1993:42-57). And 
fourth, if the attention paid to a given condition at one point in time is vastly 
greater than that paid to it during a previous or later time without any corre- 
sponding increase in objective seriousness, then, once again, the criterion of 
disproportionality may be said to have been met (Ben-Yehuda 1986, 1990:97— 
134, Goode 1993:48-53). 


VOLATILITY And fifth, moral panics are volatile: They erupt fairly suddenly 
(although they may lie latent for long periods of time and may reappear from 
time to time), and, nearly as suddenly, they subside. As we']l see, some moral 
panics may become routinized or institutionalized, while other moral panics 
vanish—seemingly—without so much as a trace; the legal, cultural, moral, 
and social fabric of the society after the panic is essentially no different from 
the way it was before. But whether it has a long-term impact or not, the degree 
of fear, hostility, and concern generated during a moral panic tends to be fairly 
limited temporally; the fever pitch that characterizes a society or segments of 
it during the course of the moral panic is not sustainable over a long stretch 
of time. 

To describe moral panics as volatile and relatively short-lived does not imply 
that they do not have structural or historical antecedents. The specific issue 
that generates a particular moral panic may have done so in the past, perhaps 
even in the not-so-distant past. In fact, moral panics that are sustained over 
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long periods of time are almost certainly conceptual groupings of a series of 
more or less discrete, more or less localized, more or less short-term panics. 
Likewise, describing a given concern as volatile does not mean that moral 
panics do not, or cannot, leave a cultural and institutional legacy. Indeed, 
elements of panics may become institutionalized; during panics, organizations 
and institutions may be established at one point in time that remain in place 
and help stimulate incipient concerns later on, at the appropriate time. 


Three Theories of Moral Panics 


Why moral panics? Why do the public, the media, the police, politicians, and/or 
social action groups in a particular society at a particular time evidence intense 
concern about a condition, phenomenon, issue, or behavior that, a sober as- 
sessment of the empirical evidence reveals, does not merit such a level of 
concern? Two dimensions distinguish the theories that have been advanced to 
explain moral panics. 

The first is the morality vs interests dimension, and the second is the elitism 
vs grassroots dimension. The first dimension addresses the question of motive: 
Do concern and activism coalesce around a given issue because of world-view, 
ideology, and morality—that is, deeply and genuinely felt attitudes and senti- 
ments—or because actors stand to gain something of value—jobs, power, 
resources, respectability, wealth, recognition, ownership of a domain of ex- 
pertise—if others become concerned about that issue? And second, are many 
actors responsible for the creation and maintenance of the panic, or few? Does 
the panic start from the bottom and progress up, or does it work from the top 
down? Or does a panic begin in the middle of society's status, power, and 
wealth hierarchy, neither from the elite at the top nor from the undifferentiated 
general public, but from representatives or leaders of specific middle-level 
organizations, agencies, groups, institutions, or associations? 

When we combine these two dimensions—motives based on morality/ide- 
ology vs status/economic interests and levels of power, status, and wealth— 
theoretically, we have six possible theories, as Table 1 shows us: 


Table 1 Theones of moral panics motives 
and origin 
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Table 1 is a very rough theoretical delineation of the possible explanations 
or theories of moral panics. Not all these possibilities are equally likely to have 
attracted proponents. Cell 1, where elites generate a moral panic out of deep 
feelings of ideology and morality, independent of their status and material 
interests, we suspect, may be regarded as a null or empty cell, essentially 
empirically nonexistent. To our knowledge, it has attracted no proponents; this 
may be because, in Western society, profoundly influenced as we are by 
Marxist thinking, it is difficult for us to imagine elite ideology divorced from 
elite interests. For all practical purposes, when Western intellectuals think of 
elite ideology, it is part and parcel of elite efforts to justify their self-interests. 
It is possible, however, that real-world examples of Cell 1 actually do exist. 
For instance, in Iran after the revolution of 1979, elites (and the masses) waged 
a war against Western values and behavior for moral, ideological, and reli- 
gious—not material —reasons. In many ways, in fact, this campaign hurt the 
material interests of Iran (and her nonreligious elites). In our view, Cell 1 
should not be dismissed out of hand. 

Cell 2 is, of course, the classic Marxist approach. It would argue that elites 
"engineer" or “orchestrate” moral panics in order to gain some material and/or 
status advantage. According to this model, elites fabricate a panic over a 
nonexistent or trivial threat—and one about which they in fact feel relatively 
little genuine concern—in order to gain something of value or divert attention 
from issues that, if addressed, would threaten their own private interests. Or 
they engineer a panic to reaffirm or maintain their ideological hegemony. To 
the extent that other sectors of the society—such as the media—are necessary 
to the scheme, they are merely "pliant" or "cooperative" in expressing elite 
interests, 

Advocates of arguments falling into Cell 3 argue that occupants of the 
middle status and power levels of the society—the police, professional orga- 
nizations, the media, intellectuals, and so on—act in large measure independent 
of elites to express or maximize their own morality or ideology. Proponents 
of arguments falling into Cell 4 argue that occupants of those same status 
levels primarily seek material or status advantage. As we shall see shortly, 
while these two cells can be separated in principle, in real life it is a bit more 
difficult to distinguish them. The interest group model sees material and/or 
status interests as crucial, but argues that moral panics originate neither from 
the top nor from the bottom, but somewhere in society's middle rungs—pro- 
fessional associations, the police, the media, religious groups, educational 
organizations—middle-level associations, organizations, groups, institutions, 
sectors, and categories of every description. The interest group model does not 
see society as controlled from the top down; interest groups often have interests 
that contradict those of elites, and the former often, perhaps usually, initiate 
crusades, panics, and campaigns in the face of elite opposition or indifference. 
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In the interest group model, ideological and moral motives tend to be down- 
played, ignored, or subsumed, while material and status interests are focused 
on as crucial. 

Cell 5, usually referred to as the grassroots" model, argues that moral panics 
are generated from the “bottom up," and, concomitantly, that morality and 
ideology are dominant motives for activists and concerned citizens. To the 
grassroots theorist, moral panics are more or less spontaneous eruptions of 
concern on the part of a large number of people about a given threat or putative 
threat. To our knowledge, Cell 6, the argument that the mass of the public 
generates moral panics primarily out of material or status considerations, does 
not have any advocates. 

While a delineation of the hypothetical models resulting from our combi- 
nation of these two dimensions may be of interest for speculative purposes, 
only three are of significance to researchers and theorists—the grassroots 
model, the elite engineered model, and the interest group model. Let us look 
at them in a bit more detail. 


THE GRASSROOTS MODEL The grassroots model argues that panics originate 
with the general public. The concern about a particular threat is a widespread, 
genuinely felt—if perhaps mistaken or at least exaggerated—concern. The 
expressions of concern in other sectors (that is, in the media, among politicians, 
political action groups, and law enforcement) are an expression or a manifes- 
tation of more widespread concern. The action of no special group or sector 
is necessary to generate the public concern that breaks out at a particular time 
about a specific issue. Instead, the concern may arise more or less spontane- 
ously, although, to be manifested in a public manner, it may require being 
catalyzed, assisted, guided, or triggered. Thus, if politicians or the media seem 
to originate or "stir up" concern about a given issue, in reality, the concern 
must have been latent to begin with. Politicians and the media cannot fabricate 
concern where none existed initially; all too often, they have tried to stir up 
concem about a given issue and failed. In the overt expression of the moral 
panic, a kind of diffuse anxiety or strain simply explodes at the appropriate 
time, given the requisite triggering event. The panic is simply the outward 
manifestation of what already existed in less covert form. Politicians give 
speeches and propose laws they already know will appeal to their constituency, 
whose views they have already sounded out; the media broadcast stories their 
representatives know the public, or a segment of the public, is likely to find 
interesting and, about certain topics, troubling. What is central to the grassroots 
theorist—what explains the outbreak or the existence of the moral panic—is 
deeply felt attitudes and beliefs on the part of a broad sector of the society, 
that a given phenomenon represents a real and present threat to their values, 
their safety, or even their very existence. 
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Panics often entail populist sentiment which sees a threat as emanating from 
powerful, high-status strata. Their very power and status imparts an ominous, 
dreadful quality to their capacity to harm common, honest, hard-working folk. 
On many issues, the rank-and-file members of Western society mistrust the 
rich and the powerful, and harbor suspicions that they are likely to engage in 
evil deeds that threaten the rest of us with harm in order to line their own 
pockets. The rumor, said to be rife in African-American communities, that the 
American corporate elite—all rich, powerful, and white—are supplying Black 
communities with drugs in order to commit genocide against them (Turner 
1993), is an apt example of this principle. 

In the British moral panics that developed in the 1970s and 1980s about 
threatened women and children, both liberals and conservatives felt it was 
possible that "the crimes of the upper classes would be concealed by a cover-up 
by an ubiquitous *old-boy network'" (Jenkins 1992:77). The revelations during 
the same period that a number of individuals with elite backgrounds and 
positions and Oxford-Cambridge educations were Soviet spys—and homosex- 
uals— generated substantial hostile populist sentiment. These scandals “helped 
condition public attitudes to the image of upper-class perverts whose activities 
were concealed by their colleagues .... Politicians were repeatedly involved 
in scandals involving homosexuality, pedophilia, or frequenting... male pros- 
titutes" (Jenkins 1992; 78). This sentiment is crystallized in the words of one 
police officer who, during a particular scandal involving underage male pros- 
titutes, said: “people always want their paedophiles to be judges or politicians" 
(Jenkins 1992:78). 

While not a moral panic according to the criteria of volatility and the evil 
doings of "folk devils," the fear of nuclear contamination qualifies as a moral 
panic, or so say some experts (Slovic et al 1991:7, 11), according to one specific 
criterion—the disjunction between the likelihood of harm (which is supposedly 
extremely small) and public fear of and concern with that threat (which is 
enormous). Extreme and, supposedly, exaggerated fear and concern have made 
it impossible for the nuclear industry to build new power plants in the United 
States for the past decade, as well as in the foreseeable future. Clearly, this 
fear is not generated by elite interests—taken as a whole, the business elite 
favors nuclear energy. This concern is grassroots generated and sustained. 
Perrow (1984:324—28) and Erikson (1990) locate this panic—this huge and 
almost unique discrepancy between what the experts say and how the public 
feels—to a factor they refer to as dread. It turns out that the experts and the 
public calculate the risk of harm in drastically different ways. Experts make 
use of what Perrow refers to as "absolute rationality"; in contrast, the public 
uses "social rationality" (1984:325). The experts' calculation is made in a 
straightforward, rationalistic fashion: What is the statistical likelihood that 
certain harmful outcomes will occur, and how harmful are these outcomes? 
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But to the public, factors other than and in addition to statistical likelihoods 
and degrees of harm enter into the picture. The public fears threats less 
according to their statistical likelihood than to the extent that they are seen as 
involuntary, uncontrollable, unknowable, unfamiliar, catastrophic, certain to 
be fatal, and delayed in their manifestation. 

Nuclear power stands at the extreme end of all these undesirable dimensions 
of fear. Its risks are seen as “involuntary, delayed, unknown, uncontrollable, 
unfamiliar, catastrophic, dreaded and fatal” (Perrow 1984:325). Aspects of risk 
not judged to be crucial in the experts’ evaluations loom extremely large in 
the public’s assessment. Radiation and other technological toxins “contaminate 
rather than merely damage; they pollute, befoul and taint rather than just create 
wreckage; they penetrate human tissue indirectly rather than wound the surface 
by assaults of a more straightforward kind” (Erikson 1990:120). Instead of 
assessing danger by calculating odds in the way experts do, perhaps we should, 
Erikson argues, “understand radioactive and other toxic substances as naturally 
loathsome, inherently insidious horrors ... that draw on something deeper in 
the human mind.” Toxic emergencies “really are different” from more routine, 
mechanical accidents, such as car crashes; “their capacity to induce a lasting 
sense of dread is a unique—and legitimate—property” (1990:121). Fear of 
radiation is not engineered by elites or a product of the activism of members 
of interest groups. It is not “some exotic form of hysteria that will subside 
once the media stops fanning the flames and calm returns to public discourse” 
(1990:125). It stems from deep, primordial feelings that may be impervious to 
machinations from above. It is a genuinely grassroots phenomenon. 

It is quite likely that drug abuse possesses some qualities similar to those 
of nuclear contamination: The control that drugs wield over users is seen as 
insidious, one that creeps into the user’s mind and body and “takes over”; the 
drug is attributed a kind of diabolical or “black magic” power that overwhelms 
and dominates the user. It exerts a power that manifests parallels with other 
dreadful agents that embody “stealth and treachery,” that “invade” the user, 
that “contaminate, pollute, corrupt, taint, and befoul,” that “penetrate human 
tissue” and “never end,” never quite seem to go away. The public finds 
widespread drug abuse riveting and terrifying in part because of these genuine 
primal fears. However misplaced they are, much of the public dreads a foreign 
substance entering the body, causing a silent genetic catastrophe among un- 
countable generations (in the case, supposedly, of LSD), or rendering the user 
an uncontrollable addict (in the case, supposedly, of heroin and crack cocaine). 

Certainly no advocate of the grassroots model would argue that, by itself, 
public sentiment is sufficient to generate a moral panic. In every case, this 
sentiment necessitates a vehicle to elevate a latent fear or concern into wide- 
spread, mutual awareness; the feelings of scattered, isolated individuals must 
be given an appropriate expression. This vehicle may take the form of the mass 
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media, political speeches, action groups, and so on. Still, to the grassroots 
theorist, the organizational vehicle that translates fears and concerns into the 
public arena is not the primary explanatory variable. Fears and concerns cannot 
be foisted off on an indifferent public; whatever the vehicle that expresses it, 
moral panics cannot be launched without widespread public support. 


THE ELITE-ENGINEERED MODEL The theory that moral panics are elite engi- 
neered argues that a small and powerful group or set of groups deliberately 
and consciously undertakes a campaign to generate and sustain fear, concern, 
and panic on the part of the public over an issue they recognize not to be 
terribly harmful to the society as a whole. Typically, this campaign is intended 
to divert attention away from the real problems in the society, whose genuine 
solution would threaten or undermine the interests of the elite (Reinarman & 
Levine 1989). Such a theory is based on the view that elites have immense 
power over the other members of the society—they dominate the media, 
determine the content of legislation and the direction of law enforcement, and 
control much of the resources on which action groups and social movements 
depend. 

Hall et al (1978) have advanced what is probably the most well-known and 
detailed analysis illustrating the elite engineered model of moral panics. They 
argue that fear of mugging in the early 1970s in Great Britain qualifies as a 
moral panic, since fear and concern over street crime increased at a time when 
its actual incidence was not rising. (For a contrary view, see Waddington 1986.) 
Reactions by the courts to mugging offenses, by the media to muggings and 
the sentencing of their offenders, and by the public to news of muggings, were 
“all out of proportion to any level of actual threat" posed by these muggings 
(Hall et al 1978:29). Why did British society react to mugging in the way it 
did? Why the public outrage, the extremely harsh sentences in the courts, the 
mobilization of the police against real and supposedly potential muggers, the 
law-and-order solutions offered by experts and commentators, the harsh glare 
of the media? Why the moral panic over mugging? What was the panic really 
about? What did mugging "represent" to the society? 

Hall et al (1978) argue that the themes of the panic "function as a mechanism 
for the construction" of a definition of social and political reality that serves 
the interests of the powerful. The early 1970s panic over street crime served 
to legitimate a law-and-order criminal enforcement program and to divert 
attention from the growing economic recession, which was causing a "crisis" 
in British capitalism. During such crises, Hall et al argue, the capitalist state 
is forced to shed its facade of neutrality and independence from special interests 
(Hall et al 1978:217) and assume “total social authority... over the subordinate 
classes" in such a way that "it shapes the whole direction of social life in its 
image” (1978:216). Domination not only seems to be “universal” and “legit- 


MORALPANICS 165 


imate," but exploitation must seem to disappear from view (1978:216). Such 
is an "exceptional moment" in capitalist society, the coming of "iron times," 
where “an authoritarian consensus" is called for (1978:217). When capitalism 
is in crisis, extraordinary times are called for; during such times, a moral panic 
may be hoked up to divert attention away from that crisis. In Britain in the 
1970s, mugging was just the issue to generate a moral panic and come to the 
rescue of an ailing capitalist society. 

At more than a half-dozen points, Hall et al (for instance, on 57, 59, 136, 
176, 182, 322) insist that they are not presenting a “conspiratorial” interpre- 
tation of these events. Rather than being part of a conspiracy, Hall et al argue, 
the ruling elite “orchestrates hegemony,” that is, manages to convince the rest 
of the society—the press, the general public, the courts, law enforcement—that 
the real enemy is not the crisis in capitalism, nor capitalism itself, but the 
criminal and the lax way things have been dealt with in the past. The media 
come “to reproduce the definitions of the powerful" (1978:57) by over-access- 
ing individuals “in powerful and privileged positions” (1978:58). The media 
"appropriate" public opinion, thereby having it "structured by... dominant 
ideology" (1978:136). Likewise, the legislature "performs its work on behalf 
of the capitalist system" (1978:208), the law being "an instrument of class 
domination" (1978:196). In short, the moral panic over mugging in Britain 
circa 1970-1973 was engineered by the elite, by the capitalist class, with the 
more or less unwitting complicity of its allies, the media, the legislature, the 
police, and the courts. This fear did not originate with the masses—indeed, it 
is contrary to the interests of the masses—nor with representatives of specific 
middle-level interest groups. And morality or ideology are not what the panic 
is primarily "about." More specifically, ideology is a mechanism in the service 
of elite hegemony, that is, it is a means by which the powerful protect their 
interests, primarily economic interests, and maintain their rule and the stability 
of the capitalist system. Hall et al's analysis of the British moral panic over 
mugging is a classic example of the elite engineered theory. 


INTEREST GROUP THEORY By far, the most widely used perspective on moral 
panics has been the interest group approach. As we saw, Howard Becker 
(1963:147—63) argued that rule creators and moral entrepreneurs launch moral 
crusades—which sometimes turn into panics—to make sure that certain rules 
take hold and are enforced. In the interest group perspective, as we saw, 
professional associations, police departments, the media, religious groups, 
educational organizations, and so on, may have a stake in bringing to the fore 
anissue which is independent of the interests of the elite. By stating that interest 
groups have an independent role in generating and sustaining moral panics, 
we are saying that they are, themselves, active movers and shakers—that elites 
do not dictate the content, direction, or timing of panics. 
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The central question asked by the interest group approach is: cui bono? — 
for whose benefit? Who profits? Who wins out if a given issue is recognized 
as threatening to the society? To whose advantage is the outbreak of a 
widespread panic about a given issue, behavior, or phenomenon? Who stands 
to gain? 

Material and ideological/moral gains have often been separated; presum- 
ably, they represent two entirely separate motivations. Interest group politics 
are often thought of as cynical, self-serving, devoid of sincere belief. In real 
life, such a separation is not always easy to make. Interest group activists may 
sincerely believe that their efforts will advance a noble cause. (And one, it 
might be added, that will also help advance their status, power, and material 
resources.) Advancing a moral and ideological cause almost inevitably entails 
advancing the status and material interests of the group who believes in it; 
likewise, advancing the status and material interests of a group may simulta- 
neously advance its morality and ideology. 

Jenkins shows that the satanic ritual abuse panic that flared up in Britain in 
the late 1980s: 


offered ideological confirmation of the limitations of liberal theology. Since the 
1960s, the dominant factions in British churches have emphasized social and 
political activism with a left/liberal slant.... For Evangelicals and Charismatics, 
this was a lethal distraction from the crucial issues of personal holiness and 
spiritual warfare. During the 1980s, the point was reasserted by a new focus on 
black magic cults, ancestral demons, and ritual abusers (Jenkins 1992:204). 


Not surprisingly, fundamentalist, evangelical, and charismatic Christian orga- 
nizations have been at the forefront of the British (and American) panic about 
satanic ritual abuse from its very inception. In this case, religious conservatives 
are arguing for a certain definition of reality—that satanism is alive and well 
in modern society and working its evil to corrupt righteousness—which ad- 
vances both their material and ideological/moral interests. 

We are arguing that the separation between interests and morality, so easily 
done in theory, is often difficult to make in practice. Perhaps, rather than 
picturing these two motives as contradictory, it is more fruitful to see both as 
operative, but, in a given moral panic, one as more influential or dominant 
than the other. Cynicism and idealism come in degrees; some activists may 
display a mix of motives, while others are more or less entirely self-serving 
or, possibly, entirely idealistic. Enough cynics cloak their self-interested mo- 
tives in pious proclamations for us to be suspicious about the purity of their 
actions. At the same time, we need not be automatically suspicious about the 
motives of actors who, while advancing an ideological or moral agenda, also 
advance their own group's material or status interests. After all, the two are 
usually found in the same package. 
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Can we draw any conclusions about the origins of moral panics aside from 
the trite, unsatisfying, and almost tautologically true platitude that different 
theories apply best to different moral panics or different aspects of a given 
moral panic? Almost certainly some latent fear or stress must pre-exist in the 
general public, or segments of the public, for a widespread panic to "take off." 
Concern over a nonexistent or relatively trivial threat cannot be fabricated out 
of whole cloth by a cynical elite or by self-serving representatives of one or 
another interest group. Advocates of the elite engineered model argue that 
elites can read or package or construct issues in such a way that the public 
will find something fearful in them, but this model assumes an extremely 
gullible public. Unlike the criminal law, which need not express the ideology 
or views of major segments of the public (see Hagan 1980), moral panics do 
necessitate the concerns of segments of the society. It is almost inconceivable 
that certain concerns could be foisted off on the public, that the public becomes 
intensely concerned about a supposed threat that, in the absence of these 
machinations, they would otherwise ignore. The grassroots approach, there- 
fore, can be thought of as calling attention to a dimension or factor—rather 
than offering a competing or independent explanation—that plays a central 
role in the moral panic. 

At the same time, the grassroots model must inevitably be supplemented 
with another explanation; it cannot be regarded as complete. While widespread 
stress or latent public fears almost necessarily pre-exist in moral panics, they 
do not explain how and why they find expression at a particular time. These 
fears must be articulated; they must be focused, brought to public attention or 
awareness, given a specific outlet, harnessed to a mechanism of expression. 
And this almost inevitably entails some form of leadership and organization. 
Although large numbers of people may spontaneously feel fear or dread about 
a given agent or threat, to become a moral panic, this fear must be sharpened, 
broadened, articulated, and publicly expressed by organized, movement-like 
activity launched by middle-level interest groups. All activists have to consider 
the question: “What if we threw a party and nobody showed up?” All the 
organizational efforts in the world cannot create public concern where none 
exists to begin with. At the same time, concern needs an appropriate triggering 
device and a vehicle to express itself in a moral panic, and for that, interest 
group formation and activity are central. 

Once again, our argument goes beyond the claim that different models are 
helpful in explaining different moral panics. It is that the grassroots provides 
fuel or raw material for a moral panic, while organizational activists’ issues 
of morality provide the content of moral panic, and interests provide the timing 
(Ben-Yehuda 1986). While the elite engineered model does not seem to work 
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for most moral panics, the grassroots model enables us to see what fears and 
concerns are made use of, and the interest group model enables us to see how 
this raw material is mobilized and intensified. By itself, the grassroots model 
is naive; by itself, the interest group model is cynical and empty. Together, 
the two help illuminate the moral panic; interest groups coopt and make use 
of grassroots morality and ideology. No moral panic is complete without an 
examination of all societal levels, from elites to the grassroots, and the full 
spectrum from ideology and morality at one end to crass status and material 
interests at the other. 


Epilogue: Moral Panics, Demise, and Institutionalization 


Although the rise of moral panics has received some attention, their demise 
has been virtually neglected. The question of the demise of moral panics is 
linked intimately with the issue of their impact: What impact do moral panics 
have? Do moral panics promote substantial, long-term social change? Or is 
their impact much like that of fads, which flare up, are popular for a time, and 
vanish without a legacy or, seemingly, a trace? 

The excitement stirred up during a moral panic is strikingly similar to the 
charisma possessed by certain leaders. This excitement, like charisma, is vol- 
atile and unstable. The feelings that are generated during its period of influence 
tend to be intense, passionate. But they do not last. How to ensure that the 
willingness of individuals gripped by this temporary fervor to follow certain 
rules or pursue certain enemies continues over time? How to translate the 
vision stimulated during the moral panic into day-by-day, year-by-year nor- 
mative and institutional policy? How to continue the aims and goals of moral 
entrepreneurs, action and interest groups, leaders, and much of the public, in 
"doing something" about the threat that seems to be posed during the moral 
panic, after the emotional fervor of that panic has died down? What we are 
suggesting is that, as with charismatic leaders, some moral panics are, almost 
unwittingly, particularly successful in routinizing the demands for action that 
are generated during these relatively brief episodes of collective excitement. 

Do moral panics have an impact on the society in which they take place by 
generating formal organizations and institutions; do they, in other words, leave 
an institutional legacy in the form of laws, agencies, groups, movements, and 
so on? If so, what is the nature of that legacy? Do moral panics transform the 
informal normative structure of a society? If so, what is the nature of that 
transformation? 

Some panics seem to leave relatively little institutional legacy. The furor 
generated by the Mods and Rockers in England in the 1960s resulted in no 
long-term institutional legacy; no new laws were passed (although some were 
proposed), and the two germinal social movement organizations that emerged 
in its wake quickly evaporated when the excitement died down. 
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Incontrast, other panics result in laws and other legislation, social movement 
organizations, action groups, lobbies, normative and behavioral transforma- 
tions, organizations, government agencies, and so on. For example, the periodic 
drug panics that have washed over Ámerican society for a century continue 
to deposit institutional sediment in their wake. President Richard Nixon's 
mini-drug panic of the early 1970s hugely expanded the federal drug budget, 
placed the drug war on a firm institutional footing, and created several federal 
agencies empowered to deal with drug abuse in one way or another. The drug 
panic of the mid- to-late 1980s left a substantia! institutional legacy in the form 
of two packages of federal legislation, passed in 1986 and 1988, a substantially 
larger federal budget, dozens of private social movement organizations, and 
public sensitization to the drug issue. In this way, not only are successive moral 
panics built on earlier ones, but even in quieter, nonpanic periods, the institu- 
tional legacy that moral panics leave attempts to regulate the behavior that is 
deemed harmful, unacceptable, criminal, or deviant. The earliest, nineteenth 
century, drug panics defined drug abuse as deviant and, eventually, criminal; 
in this sense, they generated social change. The later drug panics, in contrast, 
reaffirmed the deviant and criminal status of drug abuse after a period of drift 
toward normalization, and thus they prevented social change. 

Even seemingly inconsequential panics leave behind some sort of legacy; 
even those that produce no institutional, organizational, or formal legacy are 
likely to have had some impact in the informal or attitudinal realm. With the 
eruption of a given moral panic, the battle lines are redrawn, moral universes 
are reaffirmed, deviants are paraded before upright citizens and denounced, 
and society's boundaries are solidified. In Durkheimian terms, society's col- 
lective conscience has been strengthened. The message of the moral panic is 
clear: This is behavior we will not tolerate. Even seemingly transitory panics 
are not "wasted": They draw more or less precise moral boundaries. Panics 
emphasize the contrast between the condition or behavior that is denounced 
and the correctness of the behavior or position of the righteous folk engaged 
in the denunciation. The satanic ritual abuse scare, for example, reaffirms the 
moral correctness of the fundamentalist Christian way of life. The Mods and 
Rockers scare of 1964 to 1967 prepared the way for a later (early 1970s) moral 
panic in Britain over juvenile delinquency, street crime, and mugging (Hall et 
al 1978). The Canudos massacre reminded Brazilians that they were citizens 
of a modern, progressive, industrializing, and culturally unified nation. 

In short, panics are not like fads, trivial in nature and inconsequential in 
their impact. Even those panics that seem to end without institutional impact 
often leave normative or informal traces that prepare us for later panics or 
other events. Some, for example, leave cultural residue in the form of folklore 
(Best 1990:131—50; Turner 1993). A close examination of the impact of panics 
forces us to take a more long-range view of things, to see panics as long-term 
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social process rather than as separate, discrete, time-bound episodes. Moral 
panics are a crucial element in the fabric of social change. They are not 
marginal, exotic, trivial phenomena, but one key by which we can unlock the 


mysteries of social life. 
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Abstract 


Sociologists have always recognized the “double function” of religion in the 
legitimation of power and privilege and in protest and opposition, but theories 
of secularization and modernization predicted the declining significance of 
religion in contemporary public life. We review new approaches to religious 
activism and legitimation efforts in the United States and in the world-system 
that stress the interrelatedness of religion and politics. Then we evaluate the 
contributions of new theories of social movements and culture, Marxism, and 
feminism to new conceptualizations of the relationships among religious strug- 
gles, inequality, and political order today. 


INTRODUCTION 


Sociologists have long understood the significance of the “double function” 
of religion “as apology and legitimation of the status quo and its culture of 
injustice on the one hand, and as a means of protest, change, and liberation 
on the other hand” (Solle 1984:21). In an influential synthesis of Marx, Weber, 
and Durkheim, Berger (1967) described religion as both a “world-maintaining” 
and “world-shaking” force capable of legitimating or challenging power and 
privilege. Despite religion’s historical role in legitimation, however, theories 
of secularization and modernization have stressed the declining significance 
of religion in contemporary public life. Parsons (1964) posited a lingering but 
indirect role for religious legitimation through institutional differentiation and 
value generalization, but Berger (1967) described seemingly inexorable pro- 
cesses of secularization and privatization, and Bellah (1970:43) asserted that 
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"standards of doctrinal orthodoxy and attempts to enforce moral purity [had] 
largely been dropped" from modern religious life. 

Unexpected events in the late 1970s, however—the rise of the "New Chris- 
tian Right" in the United States and the international resurgence of religious 
activism symbolized by the revolutions in Iran and Nicaragua—have encour- 
aged a paradigm shift in the sociology of religion and legitimation. New 
approaches challenge the linear assumptions of secularization theory (Hadden 
1989) and stress the continuing importance of religion, not as the "Parsonsian 
conscience of the whole" nor the "Bergerian refuge of the periphery,” but as 
“the vital expression of groups"—"disestablished, culturally pluralistic, struc- 
turally adaptable, empowering, and voluntaristic" (Warner 1993:1047).! 

Furthermore, functionalist conceptions of religious legitimations as "control 
system[s] linking meaning and motivation" (Bellah 1970:16) or "the process 
by which one aspect of a social system confers sanctions on society as a whole 
and on particular institutions within it" (Fenn 1974:144) have been replaced 
by a view of legitimation as the outcome of bitterly contested struggles where 
religion still plays a vital role, “a sphere of activity where efforts are deliber- 
ately made to influence, manipulate, and control people's thoughts, feelings, 
and actions in accordance with various religious values" (Beckford 1983:29). 
The relevant agents in struggles over legitimation are now seen as actors and 
groups rather than systems and institutions because “the contestable character 
of legitimation derives from the fact that the uses and functions of power 
always involve participating [human] agents" (Kokosalakis 1985:371). 

Because of the importance of these developments in the sociology of reli- 
gion, we emphasize works in this review that stress the dual role of religious 
legitimations as both world-maintaining and world-shaking, call into question 
the assumptions of secularization theory, and define activism as a central 
concern. Altogether, these works suggest that struggles to legitimate and de- 
legitimate power and privilege have the potential to transform both political 
alignments and institutional arrangements. Once seemingly far removed, reli- 
gion and politics have come to be viewed as more closely entwined (Della 
Fave 1986, McGuire 1983, Schoenfeld 1993). The “publicization of the pri- 
vate” and the “privatization of public matters” both erode the lines separating 
religion from politics (Bourg 1981:298). Consequently, “politics is being rein- 
fused with religious symbols and claims, whereas religion is being summoned 
to the trenches of popular struggle, including even recourse to violent tactics” 
(Falk 1988:392). 

As boundaries blur, new alliances are forged. Marxists and Christians are 
newly allied in parts of Latin America (Dodson 1986) but provoke "'counter- 


!For a recent empirical study, however, that contends that poll data overestimate levels of church 
attendance and participation in the United State, see Hadaway et al (1993). 
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revolutionary" responses among North American conservatives (Ezcurra 
1983). Also in the United States, some Protestants, Catholics, and Jews agree 
to disagree on theology while uniting on certain shared moral stands, whether 
anti-abortion or anti-racism (Hunter 1991). Finally, images of students holding 
hostages in Iran (Germani 1989) or of "eco-religious" activists aboard the tiny 
Greenpeace “Rainbow Warrior" challenging a French warship to prevent whal- 
ing (Falk 1988) suggest shifting forms of legitimation struggles, where reli- 
gious voluntary associations challenge the power and policy of nation-states. 

Following the lead of Robert Wuthnow (1988), many sociologists of religion 
contend that these transformations in the lines separating religion and politics 
and the new configurations of allies and opponents engaged in legitimation 
struggles amount to a profound "restructuring" of religious institutions. Below 
we describe research on religious and political restructuring in the United 
States and the world-system. Then we discuss theoretical contributions from 
social movement theory and the sociology of culture, Marxism, and feminism 
that are contributing to new approaches to legitimation struggles. 


PERESTROIKA WITHOUT GLOSNOST: THE UNITED 
STATES 


Theories of Religious Restructuring 


Three influential surveys of the national scene (Hunter 1991, Roof & McKin- 
ney 1987, Wuthnow 1988), along with a community study (Demerath & 
Williams 1992) and a denominational survey (Ammerman 1990), suggest the 
broad outlines of contemporary religio-political restructuring and conflict in 
the United States. Each study traces the current phase of religious politicization 
to liberal activism beginning in the 1960s and 1970s that aimed to legitimate 
African-American civil rights, women's rights, gay and lesbian rights, and 
opposition to the war in Viet Nam, as well as to the counter-mobilization by 
evangelical and fundamentalist groups in the late 1970s that culminated in the 
New Christian Right's reputed contribution to the election of Ronald Reagan 
as president. Together, these studies suggest a shift from earlier conceptions 
of religion as an indirect legitimator of the status quo to one that emphasizes 
its directly combative stance vis-à-vis both secular legitimations and reli- 
giously based but competing moral visions of the good society. 

More specifically, studies of religious restructuring call attention to change 
on three levels. First, liberal or mainline denominations have declined in size 
and influence while conservative denominations have grown (Roof & McKin- 
ney). More importantly, however, denominationalism—described earlier by 
Lenski (1961) and Stark & Glock (1968) as an indisputable feature of the 
American religious landscape—has declined in significance as indicated by 
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trends toward denomination switching, intermarriage, and ecumenical activism 
(Wuthnow 1988). Finally, direct action, special purpose religious groups have 
become increasingly prominent on the political scene (Hunter 1991). No longer 
content merely to play an indirect role in social and political legitimation by 
influencing individuals' consciences and convictions—that is, unwilling solely 
to influence “values” according to the indirect role ascribed to religious insti- 
tutions by Parsons in “pattern maintenance" (Wuthnow 1988:58)—these new 
tactical organizations have mobilized to transform American culture directly, 
redefining or responding to changing symbolic boundaries between religion, 
morality, and politics. Mobilization by religious liberals or conservatives chal- 
lenges secular forms of legitimation such as technical expertise (Habermas 
1975), but mobilization by the religious right wing has commanded the greatest 
attention. Partly because of such mobilization, the correlation between church 
attendance and voting—positive for religious liberals in the 1960s but negative 
for religious conservatives—was reversed in the late 1970s as previously 
alienated evangelicals and fundamentalists became among the most politically 
engaged Americans (Wuthnow 1988). 

Examining the centrality of special purpose groups to “culture wars" over 
law, government, education, science, family, and sexuality, Hunter (1991:47) 
claims that “pragmatic alliances ... formed across faith traditions” are now the 
key structural elements in contemporary contests over legitimation. Regardless 
of denominational affiliation, adherents to “orthodox” conceptions of “exter- 
nal, definable, and transcendent authority” are united politically against 
“progressives” who “resymbolize historical faiths according to the prevailing 
assumptions of contemporary life" (Hunter 1991:44—45). At root, the conflict 
between differing conceptions of moral authority “is about power—a struggle 
to achieve or maintain the power to define reality" (Hunter 1991:52). 

According to Roof & McKinney (1987:9) the divisiveness among such 
groups has resulted in “the erosion of public faith, the polarization of American 
life, and heightened religious individualism." Although religion may indeed 
have “lost force as an integrative influence in American life" as Roof & 
McKinney 1987:33) contend, “the sheer existence of differing, even hostile 
interpretations," according to Wuthnow (1988:257), "need not result in total 
decay as far as the legitimating power of religious symbols is concerned," 
because religious liberals and conservatives "agree with one another in assert- 
ing the importance of religious values to the political process." 

Descriptions of religious restructuring and the interpenetration of religion 
and politics on the national level are borne out in Dermeath & Williams's 
(1992) study of religion and cultural power in Springfield, Massachusetts, and 
in Ammerman's (1990) survey of religious conflict in the Southern Baptist 
Convention. From an examination of community conflicts over homelessness, 
race, and sexuality between public officials ("arbiters of practicality") and 
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religious activists ("custodians of principle"), Demerath & Williams conclude 
that long-term secularization evokes at least short-term sacralizing responses 
best described as single-interest rather than civic-wide. The cultural power 
exerted in religious activism relies upon moral tactics mobilized by ecumen- 
ical, special-interest (liberal or conservative) religious organizations. At the 
denominational level as well, effective resource mobilization is the key to 
recent victories by fundamentalists in Baptist battles over the relegitimation 
of biblical orthodoxy, traditional ethics, conservative civil religion, the patri- 
archal family, and the delegitimation of feminism, especially women's lead- 
ership in churches (Ammerman 1990). 

Commentators agree more on the "how" than on the "why" of American 
religious restructuring. Roof & McKinney stress shifts in American values, 
especially individualism, privatism, and a “new voluntarism" attributable to 
the "baby boom" generation, but Hunter (1991:291), in a far more structuralist 
(Durkheimian) mode, claims that the “opposing moral visions" between or- 
thodox activists and progressives represent “a reality sui generis: a reality much 
larger than, and indeed autonomous from the sum total of individuals and 
organizations that give expression to the conflict." In contrast to Hunter (1991), 
Ammerman stresses the contingency of effective mobilization (agency) rather 
than cleavages in the presumptive deep structures of normative reality, Ammer- 
man points to the importance of educational differences that divide funda- 
mentalists from liberal Baptists. Also, like others (below) who interpret funda- 
mentalism as the defense of threatened lifestyles, she demonstrates that con- 
servative mobilization has been most effective among less educated Baptists 
who have experienced the unsettling disruptions of regional and rural-urban 
migration. Finally, and most convincingly, Wuthnow relates religious conflicts 
to changes in the symbolic boundaries separating religion, politics, and mo- 
rality caused by historically contingent events (Viet Nam, Watergate, Roe v. 
Wade) and by structural changes in political-economy, including the educa- 
tional revolution, the rise of the "new class," and new forms of state penetration 
into areas of daily life formerly defined as private. "As the state has expanded 
its functions, particularly in areas of welfare, education, equal rights legislation, 
and other kinds of regulation that affect the day-to-day activities of citizens, 
special interests groups have arisen for the express purpose of combatting, 
restraining, or promoting certain types of government activity" (Wuthnow 
1988:114). 

Religious restructuring and the on-going conflicts between religious liberals 
and conservatives influence political legitimation. As Wuthnow (1988:241) 
observes, "if organized religion undergoes a series of restructuring events—if 
it becomes polarized—then the legitimating myths of the nation at large may 
also be subjected to certain tensions and modifications." The fragmentation of 
civil religion is an important example. Unlike Bellah (1975) who related the 
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fate of civil religion to secularization and the tension between communitari- 
anism and individualism, adherents to the restructuration model attribute the 
decline of civil religion as much to divisiveness among religious groups as to 
secularization. Thus Roof & McKinney (1987:6,13) contend that whereas 
Americans "traditionally viewed the state, the community, the family and the 
marketplace as ‘God-given’,” many now feel “despair and disillusionment” 
since "the old faith that once unified Americans around the celebration of 
national values and purpose [has been] deemed by many to be hollow and 
deceitful.” Similarly, “American civil religion,” according to Wuthnow (1988: 
244), “is [now]... deeply divided ... does not speak with a single voice, uniting 
the majority of Americans around common ideals ... [and] has instead become 
a confusion of tongues speaking from different traditions and offering different 
visions of what America can and should be.”? 

While such religious conflicts undermine political legitimations based on 
civil religion, political conflict also transforms religious legitimations. Wuth- 
now, Demerath & Williams, and Hunter each observe that when religious 
activists struggle to legitimate their moral claims in the public arena, "they 
[find] it necessary to play more by political rules than by religious rules" 
(Wuthnow 1988:207). Such "rules" include procedural rationality—expressed 
in lobbying, PACs, and legislative efforts (Wilcox 19882), appeals to the 
common interest, and (in the battles to advance "scientific creationism" or 
anti-abortion) recourse to knowledge and evidence, each of which encourages 
accommodation to secular norms of rationality and the eventual moderation 
of demands. 

The impact of politics on religious legitimations thus leads many observers 
to predict further long-term decline in the salience of religiously based political 
legitimations in advanced societies. However, even though American political 
authority is justified largely on nonreligious normative grounds (Beetham 
1991), there is little doubt that religious activism currently influences the 
legitimacy of certain policies, the shape of constituencies and coalitions, levels 
of participation, cultural climates, and the social definitions of public and 
private spheres in the United States. 

While religious conflicts may undermine religiously grounded forms of 
political legitimation such as civil religion, they may also influence nonrelig- 
ious legitimations as well. Thus, research should be done, for example, to 
determine whether right wing demands to prohibit abortion potentially threaten 
legitimations of American political life that conservatives otherwise support 
such as those that valorize freedom of choice as well as legitimations more 
often associated with liberal agendas such as technological rationality. One 


2For an overview of research on American civil religion and a discussion of the timing of its 
diminishing interest among sociologists, see Mathisen (1989). 
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might also investigate whether right wing protests against pornography and 
television violence represent renewed forms of asceticism that call into ques- 
tion legitimations of the American way of life that stress the comforts and 
satisfactions of materialism.? Additionally, researchers should also consider 
whether conflicts among religious liberals and conservatives threaten to un- 
dermine legitimations of the American sociopolitical order that rely upon 
assumptions of the common interest and a unified public will. 

Finally, such questions assume that religious activists in the United States 
will remain bound by the prevailing normative rules of public life. However 
likely, this cannot simply be assumed. Some organized actions by religious 
conservatives such as "Operation Rescue" break laws but can be assimilated 
to legitimate traditions of civil disobedience, but some far right agendas such 
as John Rushdoony's “Christian Nation" thesis—accepted in part by the Ari- 
zona Republican Party in 1989 (Midgley 1990)—delegitimate democracy al- 
together in favor of theocratic rule and biblical law, suggesting the possibility 
that at least some religious activitsts on the far right are motivated to challenge 
radically the rules of the game in American politics. 


Empirical Research on American Religious Activism 


Thus far, we have described works that most effectively advance the theory 
of American religious restructuring by stressing the decline of denominaliza- 
tion, the rise of "special purpose groups" in the sphere of religious activism, 
and the deepening divide between religious liberals and conservatives. These 
positions are both supported and challenged by a vast empirical literature with 
contradictory implications for American legitimation struggles. For instance, 
several studies document increased attention to political issues in the preaching 
of conservative clergy during the late 1970s (Koller & Retzer 1980, Nelson 
& Baxter 1981), but researchers differ in conclusions about how far-reaching 
and effective the New Christian Right (NCR) has actually been. Although most 
researchers now agree that early estimates of massive New Christian Right 
support (Simpson 1983) were exaggerated (Sigelman & Presser 1988), some 
continue to stress the NCR's electoral potential (Diamond 1989, Guth & Green 
1991, Olson & Beck 1990, Wilcox 1989) while others discount, at least, its 
national political impact (Bruce 1993, Jelen 1987, Johnson & Tamney 1982, 
Rothenberg & Newport 1984, Tamney & Johnson 1983). 


3For challenges to political legitimations from the religous left in the United States, see Craig 
(1992) and Hill (1989). 

“Because the relationship between religiosity and politics 1s found to vary widely by race, much 
of the survey research on religion and politics reports findings for whites only. For a comprehensive 
examination of religion and politics among African-Americans, see Lincoln & Mamiya (1990), also 
Woodrum & Bell (1989). 
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Against the argument, advanced most strongly by Hunter (1991), that 
religious restructuring creates enduring alliances across faith traditions are 
studies that find: (1) anti-Catholicism to be an important predictor of Moral 
Majority support (Wilcox 1989), (ii) that there is little Moral Majority support 
among Catholics (Shupe & Stacey, 1983, also Olson & Beck 1990), and (iii) 
that there are important sociopolitical attitude differences between funda- 
mentalist Protestants and conservative Catholics (Tamney, Burton, & Johnson 
1988), and between charismatic and evangelical Christians (Jelen 1987, Smidt 
1988). 

Additionally, Hunter's argument that conflicts between orthodox and liberal 
believers reflect deep structural normative divisions is called into question by 
contradictory findings on the relationship between orthodox beliefs and support 
for the NCR. [Compare Harper & Leicht (1984) and Tamney & Johnson 
(1983), who found little or no correlation between religious orthodoxy and 
NCR support, with Shupe & Stacey (1983) and Wilcox (1989), who report 
positive relationships.] 

Hunter's deterministic thesis about enduring warfare between competing 
moral visions is also called into question by researchers (Harper & Leicht 
1984, Page & Clelland 1978, Shupe & Stacey 1983, Simpson 1983, Wald, 
Owen, & Hill 1989, Tamney & Johnson 1983) who attribute support for 
New Christian Right positions to more transient political factors. These 
factors include the defense of threatened lifestyles and reactions to declining 
status positions which are likely to diminish as conservative disputants adjust 
to modernization and as educational and socioeconomic attainments among 
evangelicals increase. For an important argument against status defense, 
however, see Wuthnow (1983:184) who contends that "it [was not] necessary 
for evangelicals to experience severe anxieties, as a ‘status politics’ interpre- 
tation of these events would suggest... [once] [t]he reconstruction of sym- 
bolic worlds created a domain in which participation simply became a 
sensible thing to do.” 

Like the findings on orthodoxy, research on the impact of religiosity (church 
attendance) on voting participation, tolerance, and conservatism also results 
in contradictory findings (compare Perkins 1992, Peterson & Takayama 1984, 
Smidt & Penning 1982, Wilcox 1988b). Nevertheless, findings that member- 
ship in fundamentalist churches (Leibman 1983, Tamney & Johnson 1983, 
Shupe & Stacey 1983, Wilcox, Linzey, & Jelen 1991) and viewership of 
televangelists (Shupe & Stacey 1983, Tamney & Johnson 1983, Wilcox 1989) 


5The extensive literature on the especially aggressive role of the Catholic Church in the politics 
of abortion as well as peace and justice issues, in addition to the ideological gaps between Catholic 
clergy and laity, is too extensive to review bere thoroughly, but good discussions and bibliographies 
can be found in Leege & Welch (1989). Penning (1986), and Tamney et al (1988). 
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correlate strongly with NCR sympathy support the centrality of resource mo- 
bilization to restructuring, as stressed most effectively by Wuthnow (1988) 
and Ammerman (1990). 

Here too, however, research suggests that NCR claims of massive mobili- 
zation have been exaggerated. Researchers report less support for NCR orga- 
nizations than expected among Midwestern, Southern, and evangelical pop- 
ulations (Tamney & Johnson 1983, Shupe & Stacey 1983, Wilcox 1989). Smidt 
& Penning (1982) report no increase nationally in the percent of population 
self-identifying as “religious conservatives” between 1974 and 1980, and 
Mueller (1983) finds no conservative trend in attitudes toward women's lib- 
eration between 1972 and 1980. 

Although Wilcox (1988b) has demonstrated that special purpose organiza- 
tions such as the Moral Majority help adherents to "see the connection" 
between conservative religious beliefs and political policy, others report con- 
siderable diversity of opinion among NCR supporters, including dissent even 
about whether religion should be involved in politics (Shupe & Stacey 1983). 
Mueller (1983) suggests that while effective political mobilization should have 
produced mass cognitive restructuring, factor analyses of attitudes toward 
women and other measures of conservatism suggest that conservative opinion 
has not coalesced into a single coherent ideological position. A later study by 
Kiecolt & Nelson (1988) shows that religious conservatives still exhibited 
lower levels of consistency of political attitudes than did liberals from 1972 
to 1984 (indicating lower psychological involvement in politics) and that 
conservatives' attitudes were even less consistent at the end of the period than 
at the beginning. Finally, while NCR mobilization has been effective on 
specific issues such as abortion, it has been less successful in shaping opinion 
about foreign policy (Rothenberg & Newport 1984) and free enterprise [see 
Tamney, Burton, & Johnson (1988) and Tamney (1991) who report that work- 
ing class fundamentalists support government intervention into economic re- 
structuring]. 


PERESTROIKA WITHOUT GLASNOST: 
THE WORLD-SYSTEM 


Turner (1991:226) notes that "On a global scale... religion often assumes 
renewed vitality as the cultural medium of political protest against internal 
colonialism by subnational cultures or against external colonialism by subor- 
dinated nationalism." While religious activists in the United States challenge 
institutionalized boundaries between religion and politics but play within es- 
tablished rules of the game, religious insurgency in other zones of the world 
system often challenges not only the legitimacy of specific regimes but also 
western concepts of secular nationalism. Observers dispute, however, whether 
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the latter represents a global trend toward "religious nationalism” 
(Juergensmeyer 1993), or merely a "cyclical phenomenon" (Smith 1990). 

Whichever, it is clear that developing societies—where political opposition 
is often expressed through religious ideology and organization (Maduro 
1982)—are especially prone to legitimacy deficits.5 Beetham (1991:172) con- 
tends that since the state sectors of developing societies are often overdevel- 
oped in relation to the rest of society (because of their leading role in de- 
velopment), political struggles there take on “special intensity.” At the same 
time, developing societies are particularly subject to “sharp economic inequal- 
ities and dislocations” because of economic dependency and capitalist pene- 
tration. Finally, Beethem (1991:172) points out that, in contrast to the advanced 
industrial societies where political suffrage was restricted during the industri- 
alization phase when capitalist legitimacy was weakest, “late industrializers... 
[are] fully exposed to the potential contradiction between the cost that eco- 
nomic transformation inflicts upon the subordinate classes, and the electoral 
power that the political system formally assigns to them.” Not surprisingly, 
then, considerable attention has been directed to the problematic relationship 
between religion and legitimation in developing societies. 

Just as secularization theories failed to predict the current significance of 
religion in the politics of advanced industrial societies, modernization theory 
too readily dismissed the importance of religious activism in Third World 
political development. Even the legitimacy of states like India, thought to be 
securely secular until recently (compare Buultjens 1986 and Sen 1993), has 
been challenged by religious radicals, while revolutions in Nicaragua and Iran 
have encouraged sociologists of religion to play particularly close attention to 
developments in Latin America and the Middle East. 


Legitimation Struggles in Latin America and the Middle East 


In Latin America scholars have examined how the Christian base community 
movement (Hewitt 1991) and liberation theology (Berryman 1987) have pro- 
vided ideology, organization, and leadership for challenges to the legitimacy 
of authoritarian regimes and to capitalist exploitation and foreign dependency 
in several countries (Levine 1979), but especially Nicaragua (Girardi 1988, 
1989, Lancaster 1988). They have also devoted attention to cross-national 
variations in the behavior of high level Catholic officials in Latin America, 
emphasizing their efforts “to serve as a mediator and reconciler of contending 
forces” (Crahan 1991:135), to adopt “an openly oppositional stance to the 
status quo” as in Brazil (Neuhouser 1989a), or to delegitimate revolution as 


ÓAs recent events dramatically reveal, communist political systems have also faced unique 
legitimation crises. On the role of religion in both the legitimation and delegitimation of communist 
regimes, see Beetham (1991), Pankhurst (1986), Ramet (1989a), and Weigel (1992), and Zuo (1991). 
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in El Salvador and Nicaragua (Blee 1984). Additionally, scholars have paid 
attention to counter-revolutionary efforts in the region by conservative Prot- 
estants (O'Shaughnessy 1990, Diamond 1989). 

Catholic-based opposition to authoritarian regimes in Latin America—as 
well as the Philippines (Tate 1990), Poland (Weigel, 1992), and Franco-ruled 
Spain (Perez-Agote 1986)—tends to legitimate liberal or socialist democracy, 
but the relationship between religion and politics among Muslims, command- 
ing at least as much scholarly attention, is far more diverse (Esposito 1983, 
Butterworth & Zartman 1992). Here, for instance, divergent tenets derived 
from, or attributed to, Islamic traditions are called upon to legitimate, for 
instance, traditional rule in Saudi Arabia (Beetham 1991), theocratic nation- 
alism in Iran (Arjomand 1986), or political insurrection in Israeli-occupied 
territories (Legrain 1991). Attempts to substitute Islamic law (the Shari'a) for 
secular (Western-inspired) legal institutions take on obviously different polit- 
ical meanings when imposed from above to shore up military regimes, as in 
Pakistan and the Sudan (Meyer 1993), or demanded from below, as in Egypt 
(Baker 1991). In addition to noting the centrality of Islamization to blocking 
Western cultural imperialism, Meyer (1993:111) also observes the diversity 
of political positions that Islamization may be called upon to legitimate: 


In some cases, support for returning to the Shairi'a could mean that Muslims are 
calling for the realization of political and economic goals such as honest and 
democratic government, accountability of public officials, social justice, and the 
redistribution of wealth, or the establishment of an egalitarian society. In another 
context, calls for Islamization could signify a pattern of hostile reactions to 
modernization measures and a commitment to buttressing the patriarchal family, 
expanding religious instruction in public schools, imposing discriminatory mea- 
sures on religious minorities and Muslim dissidents, challenging the legality of 
land reform programs, or preventing women from working and serving in public 
office. 


Comparative Studies 


We point to diverse interconnections between religion and politics in Latin 
America and in Muslim societies simply to illustrate why historical and com- 
parative analysis is so essential today for the development of theory, but, as 
Demerath (1991:38) complains, “Religion and its relation to society have too 
often been explored through the dark lens of single societies treated singularly." 
Our search for well-designed comparative studies has been disappointing, but 
a few promising efforts, and one possibly misleading strategy, deserve com- 
ment. 

The description above of divergent reactions among national Catholic hier- 
archies to opposition in Latin America suggests one obvious but underde- 
veloped avenue for comparative research. Though reductionistic, Neuhouser 
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(19892) relates different orientations toward the opposition to authoritarian 
regimes among Catholic elites in Argentina, Brazil, Chile, and Colombia to 
national differences in competition (from Protestants and Leftist parties) and 
organizational resources (proportions of priests in the population and levels of 
state support for the church). Similarly, in a comparison of Catholic opposition 
to authoritarian regimes in Nicaragua, Franco's Spain, and Poland, Johnson & 
Figa (1988) relate oppositional opportunities to variations in regime needs for 
external legitimation and contextual variations in economic development and 
differentiation as well as to levels within church authority structures. Finally, 
in a related comparison of state delegitimation in Catalonia and Poland, John- 
son (1989) examines churches’ roles in sustaining “religio-oppositional sub- 
cultures" through domestic, educational, and youth associations, with obvious 
implications for the current situation of national fragmentation in former Yu- 
goslavia (though dated, see Ramet 1989b). 

Unfortunately, comparative research on legitimation activities across world 
religions is less often deployed. However, Lincoln (1985) has surveyed a vast 
number of comparative and historical examples to suggest four useful ideal 
types of legitimation efforts: religions of the status quo, religions of resistance, 
religions of revolution, and religions of counter-revolution. Even more pro- 
vocatively, Wuthnow (1980) has broadly suggested how six religious move- 
ments (revitalization, reformation, militancy, counter-reform, accommodation, 
and sectarianism) may be related to spatial zones (core and periphery) and 
temporal phases (expansion, polarization, and reconstitution) of world-system 
development. 

Three other theory sketches deserve mention. By comparing four nominally 
religious states (Indonesia, Pakistan, Sweden, and Thailand), Demerath (1991: 
37) offers suggestive insights into religion's impact via governance vs legiti- 
mation, concluding that "religion's capital is often maximized when it is not 
a capital religion," and that “movements on the fringes of society sometimes 
exert disproportionate influence by refusing the path of compromise and co- 
optation.^ Henry (1986) shows how the changing legitimacy demands of 
colonialism and post-colonialism, in conjunction with three structural variables 
(degree of world-system incorporation, historical patterns of colonization, and 
relationships between religious organizations and national power) have af- 
fected the oppositional potential of a new religious movement (Rastafarian- 
ism), a transformative movement (Latin American liberation theology), and a 
revitalization movement (Islamization). Finally, Budde (1992) contrasts the 
politics of Catholicism between core and peripheral zones of the world system 
to describe the differentiation of “two churches"— conservative and radical— 
within Roman Catholicism. 

These comparative approaches (all but Budde's, sketchy at best) have the 
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virtue of relating religious agency, organization, and ideology to complexities 
of historical contingency and social structure. Unfortunately, this may prove 
to be less true of another approach, the comparative study of “fundamental- 
isms." 


Comparative Studies of Fundamentalism 


While many scholars are contributing independently to the study of fundamen- 
talism (see Hadden & Shupe 19892), the Fundamentalism Project of the Amer- 
ican Academy of Arts and Sciences, directed by Marty & Appleby (1991, 
1993), involves a large team of international experts. What unites their study 
of contemporary religious radicalism within the widely divergent traditions 
of Protestantism, Catholicism, Judaism, Hinduism, Buddhism, Sikhism, and 
Islam is the working hypothesis that "there are 'family resemblances' among 
disparate movements of religiously inspired reaction to aspects of the global 
process of modernization and secularization in the twentieth century" (Marty 
& Appleby 1993:2). This shared reaction, fundamentalism, is identified as a 
"tendency, a habit of mind... [that] manifests itself as a strategy or set of 
strategies by which beleaguered believers attempt to preserve their distinctive 
identity as a people or group" (Marty & Appleby 1993:3). 

Forms of religious activism as diverse as "Operation Rescue" in the United 
States, the anti-Soviet war of resistance in Afghanistan, and the Lebanese 
“Party of God" (the Hizbullah) are thus assimilated under the rubric of “fun- 
damentalisms" (see contributions in Marty & Appleby 1993). Even within 
pluralistic religions like Hinduism and Buddhism that defy fundamentalistic 
codification, radical tendencies are labeled “fundamentalist” on the basis of 
"orthopraxis" rather than orthodoxy (see Marty & Appleby 1991). 

By comparing Protestant fundamentalists in the United States, the Gush 
Emunin and Haredim in Israel, and the Muslim Brotherhood in Egypt, Marty 
& Appleby (1992:3, 4, 33, 23, 24) conclude that fundamentalism is an "inten- 
tionally disruptive” style of “religiopolitical activism" that responds to per- 
ceived identity-threats posed by modernity with "community building," the 
strategy “to scandalize," a "need for absolute authority," and a “will to rule.” 
The last tendency is viewed as especially problematic where there are no 
constitutional separations between public and private spheres or where funda- 
mentalists from competing traditions claim the same "identity defining sacred 
spaces" as do Jews and Muslims in Israel or Hindus and Muslims in parts of 
India. Fundamentalism does not necessarily result in "antidemocratic, anti- 
accomodationist, and antipluralist" policies, however, since politics inevitably 
involves "some measure of compromise and accommodation," though the 
latter are more likely in "democratic or quasi-democratic societies" than in 
“sectarian states" (Marty & Appleby 1993:5, 6, 641). 
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Many individual case studies in Fundamentalisms Observed and Fundamen- 
talisms and the State are valuable contributions to the literature on religion 
and politics. Additionally, several contributors make interesting comparisons 
across fundamentalisms, in addition to those of the project directors. For 
example, Bruce (1993) compares tendencies toward compromise and accom- 
modation among Protestant fundamentalists in situations where they are an 
ethnic minority (Ulster) and where they are not (the United States). Garvey 
(1993) usefully distinguishes the motivations and goals of fundamentalist 
movements that react to changes, threats, or opportunities in national identity 
from those that react to the state penetration of private spheres. He also shows 
tbat militancy is inversely related to regime strength. Despite such contribu- 
tions, however, we find several aspects of the Fundamentalism Project, espe- 
cially as its directors define it, quite troubling. 

First, by identifying fundamentalism as a cultural response (a "state of 
mind") to cultural phenomena (“modernism”), the approach is reductionistic. 
It abstractly removes "fundamentalists"— if, indeed, such generic actors exist 
cross-culturally (compare Swatos 1993, Hadden & Shupe, 1989b)—from the 
concrete historical situations within which they struggle, and it defines the 
object of their struggle equally abstractly, e.g. as “modernism.” Similar to 
neofunctionalists who interpret fundamentalism as modern antimodernist cul- 
tural movements aimed at “value-oriented dedifferentiation” (Lechner 1993, 
also Robertson 1986), the approach further assumes that the political and 
religious institutions fundamentalists attack were not already deeply inter- 
twined in the societies being investigated (see Levine 1990). 

The cultural reductionism of this approach stands in contrast to Germani’s 
(1989: 464) interpretation from the standpoint of world-systems theory of 
Islamic fundamentalism in general and hostage-taking in particular as forms 
of resistance to international imperialism, e.g. “home grown dependency 
control approaches" rather than “random act[s] of violence or [the] archaic 
idea[s] of a crazy clergy." Germani's approach is reductionistic in its own 
way, but in fact it shows how differently fundamentalist activism is viewed 
when seen, at least in part, as a response to dependency in the capitalist 
system with all its history of exploitation, oppression, and violence, rather 
than as a response to that "modernity" which, by definition, only the 
"backward" would oppose. 

Second, the approach is essentialistic; its definition of fundamentalism is so 
broad and general that important distinctions among types of religious activism 
are blurred. Again, examples from the Middle East suggest what is wrong. 
Said (1979) has brilliantly demonstrated how essentialistic thinking has dis- 
torted the West's understanding of the Middle East for centuries, and Naderi 
(1990; also Turner 1974) shows that classical sociological discussions of the 
region in Marx and Weber are similarly distorted. Thus, one finds many 
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repetitions in sociology (for instance, Kimmel 1989, Kokosalakis 1985) of the 
facile assertion that religion and politics are inseparable in Islam, even though 
Arjomand (1984) among others has shown this to be historically inaccurate. 
According to Baker (1991:44), “This notion handily compresses an astounding 
variety of behavior in the Muslim ‘arc of crisis’ into a phenomenon for which 
unitary causes might be sought. Everything from popular Islam, state-spon- 
sored official Islam, through Islamic mysticism, the Islam of the moderate 
center, and onto the proliferating radical movements come under this um- 
brella." 

Specifically in the case of Iran, the designaticn of Khomeini and the Iranian 
revolution as fundamentalist—to which many scholars object (Esposito 1986) 
—diverts attention from the long-term roots of revolution there (Keddie 1981), 
the social basis of clerical power (Arjomand 1986, 1988), and contingent 
factors explaining its hegemony during the 1978 revolution (Salehi 1988), the 
diversity among religious activists (Chehabi 1990), and the blend of tradition- 
alism and pragmatism in post-revolutionary policy (Yeganeh 1993). Although 
Marty & Appleman (1992:32, 182) affirm that fundamentalists can be “inno- 
vative world-builders” and “inventors more than preservers,” the label none- 
theless obscures how politically innovative Khomeini’s “revolutionary tra- 
ditionalism” and his “Mandate of the Jurist” were (Arjomand 1993), as well 
as, in general, how deeply Islamization is influenced by secular models of 
ideology and constitution-making (Arjomand 1989), Finally, the conclusion 
that in Iran (as elsewhere) “when Islamic Fundamentalism is adopted as a 
government policy, it tends to accentuate the antidemocratic, authoritarian, and 
repressive tendencies in the local political systems” (Meyer 1993:142) attaches 
too much causal weight to fundamentalism per se and not enough to the impact 
of social structural legacies that postrevolutionary regimes inherit whenever 
they attempt to transform autocratic political systems (McDaniel 1991). 

Finally, we suspect that what chiefly motivates this approach is the fear that 
Fundamentalism’s essential telos is its “will to rule.” Hinted at in Meyer’s 
assertion (above), this fear becomes explicit when Marty & Appleby (1992: 
195) refer to “the hope that many secularists and religious liberals hold... that 
radical fundamentalists or other extremist imaginings will be contained by the 
rules of a social contract that allows for a vigorous competition between 
different models of society.” Concluding their analysis of fundamentalism with 
a recitation of the genius of American pluralism, they assert that “the key to 
America’s success has been its ability to endorse competition as lawful, indeed 
as the very fibre of political and economic life, and to provide assurances... 
that within that competition the losers retain full civil and political rights and 
participation...” (Marty & Appleby 1992:194). Celebrating the orthodoxy and 
orthopraxis of liberalism while ignoring mounting evidence that the legitimacy 
of liberalism itself is being eroded (Habermas 1975), the authors cause us to 


188 BILLINGS & SCOTT 


wonder whose fundamentalism and whose “will to power" struggle for hege- 
mony here.” 


The Future of Religious Legitimations 


To conclude our discussion of religious restructuring in the world system, we 
note that scholars who adhere to the modernization model insist that religious 
activism, and fundamentalism in particular, is “inevitably coopted” by its 
struggle against modernity (Letcher 1993:31). According to Smith (1990:44) 
“oppositional politics is the forte of religion" but "[t]he most basic limit of 
religious resurgence is in moving beyond the oppositional role to institution- 
alize some vision of a modern religious state.” Whether this is true, however, 
remains an open question. Beetham (1991) has reformulated Weber’s theory 
of legitimation to include three verifiable components—legal validity, rule 
justifiability, and expressed consent; none of these preclude religiously based 
political orders in principle from being legitimate. While it is obligatory for 
critical social scientists to evaluate political legitimacy claims in such terms, 
the most crucial evaluations are those made collectively by the subjects of a 
political order. Only political practice will determine whether religion will in 
fact play a constitutive role in the creation of legitimate political life in the 
contemporary world-system. 


THEORETICAL RESOURCES 


New Approaches to Social Movements and Culture 


SOCIAL MOVEMENTS We have reviewed research on national and global 
levels that contributes to a post-functionalist reconceptualization of legitima- 
tion as effort rather than systemic process or mechanism. Fluid conceptualiza- 
tions of religion and politics have displaced older approaches to church-state 
relations or at least caused them to be rethought from a standpoint that focuses 
attention on the activity of groups rather than systems (Levine 1990). This 
approach to legitimation as effort and activity benefits from theoretical re- 
sources derived both from new approaches to social movements and to culture. 

Resource mobilization theory and “new social movements” theory have 
been utilized in, or are relevant to, most of the studies reviewed above. Al- 
though too extensive to be reviewed here thoroughly, a few key premises of 
these approaches should be noted since they have provided important concep- 
tual tools for sociologists of religion. In contrast to older conceptualizations 


"In contrast to the view that liberal-democratic ideology evolved naturally as the secular 
extension of Protestantism, Zaret (1989) argues that it arose as opposition to religious radicalism in 
seventeenth century England. Zaret’s analysis suggests interesting affinities between liberal 
ideology then and its opposition to fundamentalism today. 
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of social movements as collective, often irrational, responses to sociocultural 
strain or breakdown, resource mobilization theory assumes that social discon- 
tent is both rational and ever-present. Stressing objective variations in such 
factors as interests, resources, opportunities, leadership, and organization 
among movements, the theory assumes that what stands in need of explanation 
is not participation per se (as in the status defense studies reviewed above) but 
the relative success of strategic efforts to mobilize discontent (Jenkins 1983). 

As we have shown, Wuthnow (1988), Hunter (1991), and Ammerman 
(1990) use resource mobilization theory to highlight the increased prominence 
of special-purpose groups in North American religious activism. Among stud- 
ies cited earlier, Liebman (1983), in particular, deployed the theory in a highly 
effective comparison of the relative success of four New Christian Right groups 
seeking to influence national politics in the United States. He showed that 
recruitment among pre-existing collectivities (especially Baptist churches), 
more than differences in financial resources, communications networks, in- 
cluding televangelism, or organizational expertise, explained the relative suc- 
cess of the Moral Majority movement. Others have applied the theory to 
explain left-wing religious activism in the United States including the sanctuary 
movement (Wiltfang & McAdam 1991) and the role of African American 
churches in southern civil rights movement (Morris 1984). 

Social movement theorists in the United States have also begun to suggest 
the utility of resource mobilization theory for understanding religious move- 
ments such as Islamic militancy within the world system (Snow & Marshall 
1984). Smith (1991) has applied a variant of the theory to show how variations 
in political opportunity, insurgent consciousness, and organizational strength 
mediate the impact of sociocultural change and thus influence the development 
of liberation theology in Latin America. Diamond (1989) has shown how the 
US national security establishment has mobilized the resources of North Amer- 
ican evangelical and pentecostal Christians for counter-insurgency there as 
well. 

Resource mobilization theory, described as having a "strategic-instrumental 
bias," works best for explaining how preexistent groups struggle for power 
and privilege but is less effective for explaining the emergence of new forms 
of collective identity, consciousness, and solidarity (Cohen 1985:683). The 
theory of “new social movements," developed largely by European sociologists 
to conceptualize newly emergent forms of activism such as feminism and 
ecology that forge new social identities and assert new cultural definitions in 
civil society, treats the latter as "accomplishments" rather than "givens" of 
social movements (Cohen 1985 p. 685). Although often seen as antithetical, 
resource mobilization theories and new social movement theories are comple- 
mentary. 

Although developed primarily to focus on democratic reform movements 
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in the economically advanced societies, the theory of new social movements 
has proven useful for understanding religious movements in general (Hannigan 
1993, Beckford 1989). We have already examined studies that interpret funda- 
mentalist movements as reactions to threatened identities and lifestyles. While 
resource mobilization theory captures the strategic aspects of such movements, 
new social movement theory illuminates how fundamentalist movements, for 
instance, both defend and modify social identities in the course of mobilization. 
This can be shown by reexamining Wilcox et al’s (1991) study of premillen- 
nialist Protestants in the Moral Majority movement. The authors do not ex- 
plicitly use new social movement theory, but they nonetheless show that 
although adherence to a premillennialist theology decreases members’ beliefs 
in the efficacy of political action, their belief that Satan poses an active threat 
to the United States nonetheless propels them into action as what the authors 
term “reluctant warriors.” This new identity as “reluctant warriors” is forged 
by participation in the Moral Majority movement which, in other research, 
Wilcox (1988b) has shown to be instrumental in mobilizing participants to 
“see the connections” between religious beliefs and political activity. 

In addition to mobilizations among religious progressives or conservatives 
that aim directly to influence policy-making or define political reality, other 
forms of religious-based social movements indirectly influence legitimation 
efforts. Especially in developing societies, “new religious movements” con- 
tribute to both political opposition and accommodation. Also too vast a liter- 
ature to be reviewed here systematically (see Beckford 1986), a few exemplary 
studies can be noted. In addition to Henry’s (1986) discussion of oppositional 
elements in Ras Tafari (above), Glazer (1986), for instance, examines the 
transformation of Vodum, Spiritual Baptism, and Ras Tafarianism movements 
in the Carribbean from opposition to accommodation, and Pereira de Queiroz 
(1989) explains how Umbanda, an Afro-Brazilian cult, functions as a form of 
civil religion by constructing national identification among a marginalized 
urban population group. In a work that recalls classic studies of messianic and 
millenarian cults among colonized peoples by Lantenari (1963) and Worsley 
(1968), Jules-Rosette (1989) shows that syncretic “religions of oppression” in 
Africa variously become movements of protest, opportunity, or mobility. Al- 
though most frequently deployed to interpret movements in the advanced 
societies, new social movement theory potentially helps to bring the identity 
formation aspects of such new religious movements into sharper focus, high- 
lighting this element of legitimation struggles in post-colonial situations (Han- 
nigan 1993). 


CULTURE New approaches to the sociology of culture complement social 
movement theory because they too stress the role of human activity rather then 
system agency in legitimation. Moving away from older cybernetic models 
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that depicted culture as the unmoved mover of social life as well as those that 
reduced culture to values, these newer works stress, instead, the production of 
culture by social actors. According to this view (Swidler 1986:284), “a culture 
has enduring effects on those who hold it, not by shaping the ends they pursue, 
but by providing the characteristic repertoire from which they build lines of 
action.” 

Important differences in the approach to legitimation result. nonetheless, 
from whether researchers choose to stress structural aspects of the cultural 
“repertoire” or the active construction of “lines of action.” Thus, for example, 
Hunter (1991), as we have already shown, stresses structural differences be- 
tween two opposing orientations toward authority that he sees “expressed” in 
America’s cultural wars. Likewise, contending that “the cultural life of society 
can be visualized as a web of intertwining sets of binary relations,” Alexander 
& Smith (1993:157) depict American civic discourse as having been shaped 
throughout two centuries of development by the interplay of two underlying 
symbolic codes, one “democratic” and the other “counter-democratic.” 

In contrast to such semiological approaches in which culture appears to 
produce its own objects, other researchers develop constructionist approaches 
to culture that stress social actors’ active role in the definition, interpretation, 
and negotiation of social reality. In a paradigmatic study of religion as cultural 
power, Williams & Demerath (1991:417, 418) interpret American civic culture 
as “an open text” that provides “resources used differently by groups to 
interpret and legitimate their places and agendas within the community.” The 
authors show concretely how two cultural themes, civil religion and the sep- 
aration of church and state, are variously enforced, compromised, or negotiated 
in political struggles between public officials and challengers. 

Likewise, other scholars have shown that identical religious traditions may 
be used to legitimate quite contrary political objectives. For example, Fulbrook 
(1983) has explored how Puritanism was used to legitimate absolutism, polit- 
ical quietism, and political opposition to established authority in Prussia, 
Wurtemberg, and England respectively. Billings (1990) has shown that evan- 
gelical Protestantism in the American South was used to blunt textile workers’ 
activism while the same traditions contributed to coal miners’ militancy. Power 
and circumstance determined which ends were served. Finally, Keddie (1985) 
has shown that observances of an important religous holiday in Shi’i tradition, 
the celebration of the martyrdom of Husain, changed from being a normally 
quietistic event to a rebellious one during the Iranian Revolution. “A single 
doctrine or doctrinal element,” she notes (1985:161), “thus can change from 
rebellious to quietist, quickly or slowly, and when one knows enough of the 
circumstances one can often explain the nonideological forces behind these 
changes.” 

Values, beliefs, and attitudes—predominant elements in the functionalist 
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model of culture—still continue to be important, however, in the sociology of 
religion, as our review of research on the New Christian Right has indicated. 
In that research, as in comparative studies, their utility depends upon how they 
are deployed by researchers. In a work exhibiting the worst effects of essen- 
tialism, Verbit (1978-1979) traces persistent impediments to peace in the 
Middle East to unmediated effects of antithetical forms of telos in Judiasm 
and Islam, which he characterizes as "sovereignty" and “hegemony,” respec- 
tively. Apparently influenced by post-functionalist approaches to culture, 
Marty & Appleby capture important performative and constructionist elements 
in fundamentalism when they stress fundamentalists' inventiveness and their 
use of "scandalizing" rhetoric to demarcate identity boundaries. But we criti- 
cized them (above) for reducing and essentializing fundamentalism as a "state 
of mind." 

New theories of social movements and cultural production thus both con- 
tribute to the study of legitimation by providing conceptual resources that 
heighten the understanding of religious agency, but they are only as effective 
as arc the interpretations of historical context and social structure within which 
they are deployed. The latter, however, require macrosociological resources 
external to both approaches. 


Macro-Sociological Contributions of Western Marxism 


WORLD-SYSTEM THEORY World systems theory is especially promising for 
helping to conceptualize the global context of religious legitimation struggles. 
Some sociologists of religion rightfully critique the approach, however, for its 
neglect of religous activism in global dynamics, but they err in reconceptual- 
izing the global field of struggle as one defined principally by awareness and 
values (Robertson 1992, Swatos 1993). As we argue above, religious activists 
struggle within the capitalist world-system, not within abstract spheres of 
modernism or globalism. 

Following the lead of British cultural theorists such as Raymond Williams 
and social historians such as EP Thompson and Christopher Hill who have 
helped to purge Marxism of cultural reductionism, Maduro (1982:80) has taken 
steps to open world-systems theory to religious research. By conceptualizing 
the “relative autonomy" of religion, he shows that “the religious area is always 
situated in a determinate social context that limits it and orientates it, and hence 
renders the specific autonomy of religion relative to that context.” While 
liberation theologians have taken giant strides toward developing theologies 
that address the moral implications of their concrete situations within global 
capitalism, sociologists, however, have taken only first steps toward a similar 
contextualization of religous activism (Budde 1992, Henry 1986, Germani 
1989, Wuthnow 1980). 
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HABERMAS An alternative Marxist approach, Jurgen Habermas's revision of 
critical theory, is helpful, however, for contextualizing national religous con- 
flicts over legitimation. Habermas attempts to overcome Marxist reductionism 
by reconceptualizing sociocultural evolution as societal learning on two inter- 
connected levels, world mastery (instrumental rationality) and social interac- 
tion (communicative rationality). In his most recent formulation (Habermas 
1984, 1987), instrumental rationality originates in the sociocultural lifeworld 
(the site of cultural reproduction, social integration, and socialization) but 
develops autonomously through the guidance of self-steering system mecha- 
nisms (markets and bureaucratic decisions). At the same time, rationalization 
of the lifeworld (the “linguistification of the sacred,”) increasingly frees social 
interaction from historical forms of dogmatism such as religion. 

Modernization, according to Habermas, opens advanced societies to the 
possible expansion of democracy, but their communicative potential (“discur- 
sive will formation”) remains thwarted by the uncontrolled operation of in- 
creasingly autonomous economic and political systems that increasingly 
penetrate and “colonize” the lifeworld. System crises originate principally in 
capitalist economies but are displaced to states. When the latter are unable to 
accomplish the contradictory tasks of facilitating economic grcwth and secur- 
ing mass loyalty through social welfare, disturbances arise in the lifeworld as 
loss of meaning (legitimation crises), diminished social obligations (crises of 
anomie), and psychopathologies (crises of motivations). Religious activism 
can be interpreted as a defensive reaction of the lifeworld to such crises. Thus 
in an empirical application of Habermas's model to conservative religious 
movements, Fields (1991:175) argues that “activist fundamentalism” can be 
interpreted as a "response to efforts to deal with contradictions within social 
systems and to struggles among the state, economy, family, and religion to 
control those efforts." 

Habermas's work continues to gain influence in the sociology of religion. 
Like Bellah (1975) before them, contributors to the theory of American re- 
structuring draw upon his early work (Habermas 1975), especially, to explore 
competition between moral and religious legitimations and technocratic ones. 
His more recent work, however, affords a deeper structural understanding of 
why the symbolic boundaries between religion, politics, and morality are in 
flux and why new social movements arise where they do, on the "seams" 
between systems and lifeworlds. But there are drawbacks to Habermas's ap- 
proach. Contributors to the new sociology of culture (Zaret 1992) and feminists 
(Fraser 1989) both criticize his conceptualization of power as external to the 
lifeworld before "colonization." Additionally, besides possibly being West- 
centric, his evolutionary approach assumes secularization and dismisses reli- 
gion as a progressive influence (Rothberg 1986), treats movements as largely 
reactive (Cohen 1985), and possibly confines the value of his insights to 
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advanced societies. However, Larry Ray’s (1993:134) use of Habermas’s 
theory to interpret the Islamic revolution in Iran as “an etatist, modernizing, 
authoritarian social movement, originating in the legitimacy and mobilization 
problems of the peripheral state,” suggests that Habermas’s approach can 
fruitfully be applied to movements in developing societies. 


GRAMSCI Other scholars have turned to the political sociology of Antonio 
Gramsci (1971) to understand macro-sociological contexts of religious activ- 
ism. At the center of Gramsci’s sociology was the concept of “hegemony.” 
Always backed up with physical force, “hegemony” refers to the moral and 
intellectual leadership exercised in society—whether for liberation or oppres- 
sion—by dominant groups (or those struggling for dominance) whose authority 
is grounded in economic, political, cultural, and social institutions. In contrast 
to sudden “wars of manoeuvre" that capture state power, such as the Bolshevik 
revolution in Russia, Gramsci stressed long-term “wars of position" within 
civil society as the most important political and cultural struggles for the hearts 
and minds of modern populations. He pointed to the central role that intellec- 
tuals play in such struggles by formulating political world views that articulate 
the needs and interests of subaltern classes, thereby winning their consent to 
be governed by hegemonic classes. 

Gramsci's ideas about the role of religion in civil society and political 
change, over-viewed by Fulton (1987) and Mansueto (1988), have proven 
useful in a variety of historical contexts. One of the most influential applica- 
tions is Genovese's (1976) study of the hegemonic power of American slave 
holders and of the centrality of religion to the forms of accommodation and 
resistance that African-American slaves used in constructing an oppositional 
culture and identity. Gramsci's analyses of how clergy may serve either as 
“traditional” or “organic” intellectuals, blocking or promoting change, has been 
used to interpret the spread of liberation theory in Latin America (Adriance 
1985), accommodation and resistance among American industrial workers 
(Billings 1990), conflicts between clergy and other intellectuals during and 
after the Iranian Revolution (Mohseni 1992), and intellectual partisanship in 
*culture wars" in the United States (Hunter 1991). Others have fruitfully used 
his concept of hegemonic struggles to examine the role of the Shari'a in the 
genesis of the Irano-Ottoman historical bloc (Kusha 1990-91), Western atti- 
tudes toward Muslim culture (Said 1979), class collaboration in African mis- 
sionary efforts (Miller 1990), struggles against caste in India (Pillai 1982), the 
role of religous beliefs and rituals in anticolonial resistance among the Ilpara- 
kuyo and Masai peoples of Tanzania and Kenya (Rigby 1989), and left-wing 
religious activism in contemporary Italy (Girardi 1988). 

Like the frequent saying that Unitarians are religionists who have lost their 
faith in God but retain their habit of going to worship, Gramsci's thought can 
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be appropriated by post-Marxist theorists who wish to remain within the 
horizon of the Left’s emancipatory project but for whom the sacred covenant 
with the proletariat has been broken. Anticipating insights developed more 
recently by resource mobilization theory and new social movement theory but 
combining these in a macro-level theory of politics, Gramsci conceptualized 
the process of hegemony as not simply the formation of political allegiances 
among preformed identities but the articulation of new political identities as 
well (Laclau & Mouffe 1985). Thus, his approach seems especially fruitful for 
conceptualizing forms of religious activism that are not directly reducible to 
class identification, but which nonetheless respond to and help shape devel- 
opments in the capitalist world. 


Gender Analysis 


Studies that make gender both a topic of analysis and a conceptual resource 
also deepen the understanding of religion’s potential to legitimate or challenge 
patriarchal political relations, both within the family and in the broader political 
structure as well. Many of the studies of fundamentalism that we have already 
reviewed, for instance, call attention to fundamentalists’ defense of patriarchy 
in the United States (Hunter 1991, Ammerman 1990) and in developing soci- 
eties (Marty & Appleby 1992, Meyer 1993). In addition, there is a large body 
of quantitative survey findings concerning the impact of religion on sex roles, 
gender ideology, public opinion, and political action. Space limitations prohibit 
an exhaustive survey of this literature, but a few recent examples illustrate the 
contradictory, but provocative thrust of this research. . 

Survey research reports that in the United States fundamentalism predicts 
preference for patriarchal family forms more so than religiosity (Grasmick et 
al 1990), that the relationship between religiosity and traditional gender ide- 
ology is mediated by education (Barrish & Welch 1980), and that employment 
is closely linked with feminist attitudes among white Catholic and Protestant 
women, except for Evangelical Protestants who continue to give traditional 
family roles priority over career demands (Wilcox & Jelen 1991). In surveys 
focusing upon the extra-familial political realm, Rinehart & Perkins (1989) 
report that religion is no more a force for conservatism among American 
women than men, as previously thought. Yet Himmelstein (1986) argues that 
religion is the single most important variable accounting for antifeminist 
attitudes among women. Even more puzzling, Mayer & Smith (1985) find that 
in Europe, where women’s greater religiosity functions to subvert Left-feminist 
agendas, feminists are no more likely to vote for progressive candidates than 
for centrist and conservative ones. 

In questioning the epistemological basis of quantitative studies, a second 
group of qualitatively based studies embraces and explores the contradictory 
relationship between religion and patriarchal legitimation disclosed through 
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the survey-based analyses. In so doing, these authors advance both the post- 
functionalist reconceptualization of legitimation as a contested process and the 
feminist exploration of gender as a sociocultural construction. For example, 
several studies examine how American women reconcile their changing social 
roles and increasingly feminist orientations with ongoing commitments to 
androcentric, patriarchal religions. Among them are investigations of how 
Orthodox Jewish (Kaufman 1989), evangelical Christian (Stacey & Gerrard 
1990, Stacey 1991), and charismatic Christians (Rose 1987) make "feminist 
sense" out of their religious beliefs, negotiating nontraditional sex roles and 
securing new avenues of power for themselves within the context of male- 
dominated institutions with ostensibly patriarchal ideologies. 

Post-functioinalist and feminist studies such as these advance our under- 
standing of the relationship between the social construction of religious cul- 
tures and legitimation by showing how religious rhetoric and discourse are 
used both to construct new models of gender and to resolve tensions between 
patriarchal and anti-patriarchal ideology and practice. Jablonski (1988) for 
instance, has shown how discursive and ritual practices contribute both to 
accommodation and resistance among Catholic feminists, and McMillan 
(1988) has explored how liberal Protestant denominations alter rhetoric and 
ritual to accommodate to feminist challenges to androcentric language. Law- 
less (1988) and Gilkes (1985) show that pentecostal churches tend to marginal- 
ize women yet often allow them more leadership opportunities and discursive 
space than mainline denominations do. However, Scott's (1993) analysis of 
the failure of an Appalachian pentecostal revival to bring apostate women back 
into the church shows how deeply patriarchy penetrates working class churches 
that, from a Marxist perspective, appear to challenge capitalist hegemony. 

Finally, such studies also inform cross-cultural analyses of religous move- 
ments and politics. Thus, Finkler (1981) found that a Mexican healing cult 
was not simply an expression of women's powerlessness in society but also 
an avenue for power and leadership in the public domain. In Muslim societies, 
Mernissi (1977) described Moroccan saint sanctuaries as "anti-establishment 
arenas" where women escaped male-defined reality and, more recently, 
Kousha (1992) and Yeganeh (1993) document both women's contribution to 
the Iranian revolution and their post-revolutionary resistance to the current 
regime. Neuhouser (1989b) shows how participation in the Brazilian base 
community movement creates new opportunities for women to exercise public 
leadership and resist domestic patriarchy, while Burdick (1990) and Gill (1990) 
claim that Latin American pentecostal movements, though failing to oppose 
class and gender oppression, nonetheless empower women participants. Sim- 
ilarly, Cucchiari (1990) delineates Sicilian pentecostalism's patriarchal and 
anti-patriarchal elements, arguing that this religion has facilitated the negoti- 
ation of a new gender system in Sicily. 
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CONCLUSION 


We have examined the development of recent post-functionalist studies of the 
relationship between religion and legitimation in the United States and the 
world system. Giving equal attention to the role of religion in the twin political 
processes of accommodation and resistance to inequality, these works chal- 
lenge linear assumptions about the differentiation of religion and politics and 
religion's diminishing role in legitimation struggles. Additionally, we have 
reviewed theoretical developments in Marxist social theory, feminist sociol- 
ogy, and new approaches to social movements and culture that sociologists of 
religion are currently drawing upon to interpret the relationship between reli- 
gion and power. Ás attention to religious activism and its role in political 
legitimation moves from the margin toward the center of academic concern, 
the sociology of religion offers greater promise than ever to illuminate strug- 
gles over power and privilege, both in the United States and elsewhere. 
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Abstract 


There is considerable controversy over the causes of the completed fertility 
transitions that occurred in most industrial countries from 1870 to 1930 and 
the "new" fertility transitions that are currently underway in the developing 
world. New data and empirical analyses of both historical and contemporary 
fertility declines have weakened the standard theory of the demographic tran- 
sition, but none of the plethora of new theories of fertility change have emerged 
as hegemonic or as alternative guides to empirical research. The vast body of 
empirical evidence on the origins, speed, and correlates of fertility declines in 
different historical and geographical settings shows more diversity than a 
simple theory of fertility change would predict. The challenge for the field is 
to develop a common theoretical framework that will accommodate the diver- 
sity of historical paths from high to low fertility. 


INTRODUCTION 


Over the course of the last century, changes in death and birth rates have 
transformed the character of life for virtually every society and family on the 
planet. Decreases in mortality have led in most parts of the world to reasonable 
expectations that parents will see virtually all of their children survive infancy. 
Childbearing has receded from the center stage of family life and from its 
primary role in the lives of adult women to become an option that can be 
scheduled and sequenced with vocational and lifestyle pursuits. Most parents 
can expect to live to see their grandchildren. Although these "new" demo- 
graphic patterns are most common in advanced industrial societies, they are 
on the near-term horizon for most societies around the globe. If human progress 
is to be measured by longevity and reproductive control, the present century, 
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and the second half of it in particular, has no historical parallel for the ad- 
vancement of the human condition. 

Many people, social scientists included, assume that these demographic 
revolutions (including those still in process in the developing world) are 
products of the economic and technological changes of the modern era that 
have led to economic development, mass communications, effective programs 
of public health and curative medicine, and related social changes. This basic 
idea—that lowered mortality and lowered fertility, after some lag period, 
follow from socioeconomic development—is widely known as the theory of 
the demographic transition. Demographers, however, are not so sure. Broad 
empirical generalizations and theory construction were perhaps simpler tasks 
in an age with little empirical data. Over the past few decades, intensive 
research on demographic change in historical and contemporary societies has 
revealed complex patterns that do not fit neatly into earlier theoretical schema. 
The fact that fertility transitions in many developing countries are still "in 
process" adds more uncertainty to the search for explanations. 

In recent years, the field of demography has spawned a variety of new ideas, 
conceptual and measurement frameworks, and theories of demographic 
change. The debates in the journals are hot with conflicting claims on every 
issue from questions of measurement and the relative importance of causal 
forces to the ideological bias of researchers and of the entire field (Thomas 
1991, 1993, Cleland 1993). This essay is a critical review of some of the recent 
research and the theoretical debates on fertility transitions in different social, 
economic, and cultural contexts. Research on contemporary fertility trends and 
variations in low fertility settings (the United States and Europe) falls into a 
somewhat different literature and is not directly covered here. 

The review begins with issues of measurement of fertility and fertility 
change. Next, I summarize the recent evidence on fertility levels and trends 
in historical and contemporary populations. The core of the essay consists of 
a comparison of classical and contemporary theories of fertility transitions and 
a discussion of unresolved issues in current research. Although I offer my own 
evaluation of the relative merits of different approaches in the field, I do not 
attempt to provide closure to current debates. In lieu of a conclusion, I suggest 
an alternative model for the field that implies a question very different from 
the one posed in the title of this essay. 


THE MEASUREMENT OF FERTILITY AND FERTILITY 
CHANGE 


Questions of theory and interpretation are closely bound up with the measure- 
ment of the phenomena of interest. Any discussion of current research on 
fertility must be based on a clear understanding of some critical aspects of the 
conceptualization and measurement of fertility. 
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Fertility Rates and Population Growth Rates 


It is easy for the nonspecialist to become confused with the variety of demo- 
graphic measures and rates. For example, population growth rates and fertility 
rates are often conflated in the media (and sometimes by scholars). At present, 
population growth rates in many developing countries remain high even though 
fertility rates have declined rapidly. This is because mortality has decreased 
further than fertility, and the age structure of the populations of many devel- 
oping countries has a disproportionate number of persons in the childbearing 
ages (this is a byproduct of high fertility in prior years). The confusion between 
population growth rates and fertility rates often leads to frustration for those 
who just want a simple answer to the question of whether the "population 
problem" is getting worse or getting better. The question needs to be more 
precisely framed. 

For the purposes of this essay, the reader need have only general under- 
standing of one fertility measure and one related concept. The index of fertility 
is the "total fertility rate" (TFR), which is the average number of children born 
to women who survive to age 50 in a population. The additional concept is 
"replacement level fertility," which is a TFR of a little more than two births— 
the reproductive level needed to ensure the replacement of the mother and 
father in the next generation. 


Period and Cohort Perspectives 


Fertility can be viewed as either a life-course phenomenon of women (and 
men) over their reproductive careers or as behavior within a specific interval 
(e.g. a single calendar year). This distinction—between cohort and period 
perspectives (and schemes of measurement)—is a central one for fertility 
analysis. Some measures of fertility can be constructed with either cohort and 
period data (e.g. total fertility rates, although children-ever-born is the more 
common term for a cohort measure), and care must be taken to be sure of the 
basis of measurement and the implications for interpretation. 

The conventional demographic wisdom until fairly recently was that cohort 
measures of fertility were superior to period measures because they tap the 
life experiences of a real group that lives through a particular era of history 
together (Hajnal 1947, Ryder 1969, 1983). Ryder (1965) argued that social 
change (not just demographic change) occurs primarily via the behavioral 
patterns of new generations that are exposed to significant historical events at 
the formative stage of their socialization. Much fertility research, however, 
has tended to rely on period measures because of the nature of available data. 
Completed cohort fertility cannot be computed until the end of the childbearing 
span for each generation (until age 45 or 50). The formal interdependence of 
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period, cohort, and age has made it difficult, but not impossible, to estimate 
effects of each. 

In spite of the general theoretical preference for cohort measures, recent 
empirical studies of fertility trends have found that period influences tend to 
be more powerful than cohort influences in explaining variations in fertility 
rates (Foster 1990, Ni Bhrolchain 1992, Raftery et al 1993). In models of 
cyclical patterns of fertility change in advanced societies, the distinction has 
been critical because hypotheses are typically formulated in terms of the 
interplay of cohort and period processes (Easterlin 1976b, 19782, Butz & Ward 
1979). Much less debate has occurred over the appropriate frame of reference 
for studies of fertility in developing countries, but the greater availability of 
data and new analytical methods will probably shift attention to this issue in 
the coming years. 


The Focus on the Emergence of Fertility Control 


In much of the recent literature on historical and contemporary fertility tran- 
sitions, the primary focus has been on the emergence of intentional control of 
fertility within marriage (in contrast to variations and change in absolute levels 
of fertility). The reasons for this emphasis are found in several influential 
studies and their links to a particular theoretical interpretation of fertility 
change. Because I regard this emphasis as unduly narrow, I want briefly to 
review the conceptual and methodological assumptions of these classic studies 
and their subsequent imprint on the development of fertility research. First, a 
digression on potential and actual fertility is necessary to set the stage for the 
discussion of the concepts of "natural fertility" and "controlled fertility." 

One of the nonintuitive features of human reproduction is that high (seem- 
ingly uncontrolled) levels of fertility are actually well below what would be 
unrestrained fertility (Bongaarts 1975). High-fertility societies rarely exceed 
an average of eight or nine live births per woman, which is well below the 
biological maximum that could be achieved in the absence of social and 
cultural norms that regulate marriage patterns, frequency of intercourse, and 
length of breastfeeding in noncontracepting societies. The classic articles on 
"jntermediate variables" by Davis & Blake (1956) and on "proximate vari- 
ables" by Bongaarts (1978, Bongaarts & Potter 1983) provide conceptual and 
analytical frameworks for the incorporation of these sociobiological factors as 
intervening variables in the fertility determination process. These conceptual 
frameworks, however, do not explain the origins of the social and cultural 
influences that constrain fertility in high-fertility settings. 

In one of the most influential articles in modern demography, Henry (1961) 
presented the concept of "natural fertility" to characterize fertility in societies 
where married couples do not consciously try to limit the number of children 
they have. Variations in natural fertility can range by a factor of two or more 
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(e.g. from a TFR of 4 or 5 to more than 8). Henry defined "fertility control" 
(breaking with natural fertility) as parity-specific behavior to restrict fertility— 
"when the number (of births) reaches the maximum that the couple does not 
want to exceed" (1961:81). Transitions from natural fertility to controlled 
marital fertility are critical historica! moments, according to Henry, that began 
the shift from the regulation of births by traditional social customs to the low 
levels of reproduction that are typical of modern societies. 

Henry's definition of natural fertility as the lack of parity-specific control 
led to the development of a variety of ingenious indirect methods to measure 
intentions (conscious efforts to limit fertility) with only standard demographic 
data. These innovative methods include the inspection of the shape of age-spe- 
cific fertility curves (Knodel 1977) and the famous "M & m" indicators 
developed by Coale & Trussell (1974, 1978) as deviations from an empirically 
observed set of natural fertility patterns. Gradually, these new methods, and 
the assumptions behind them, led to a focus on explaining the emergence of 
fertility control as the primary question in the field—and a lack of interest in 
explaining the wide variations in fertility among "natural fertility" populations. 

This focus on the emergence of conscious fertility control is illustrated in 
the analysis and the conclusions of the Princeton European Fertility Project 
(Coale & Watkins 1986). Even when one could not show direct measures of 
“parity-specific control,” patterns of sustained declines in marital fertility were 
interpreted as the product of conscious planning by couples to limit their 
fertility after reaching their desired family size. This interpretation may well 
be correct, but its attractiveness was enhanced by the fit with the “new” 
explanation of the European fertility transition from 1870 to 1930 as a product 
of the diffusion of ideas and knowledge of the means of fertility control. 

There are several problems with the exclusive focus on the transition from 
natural to controlled fertility as the central empirical question in the field. Most 
basic is the simple measurement question of whether fertility control is accu- 
rately measured by indirect measures. Guinnane et al (1992) show that changes 
in aggregate fertility (Ig) and “little m" may be poor indicators of the origins 
of fertility control (defined as stopping behavior). Moreover, fertility control 
could be based on “spacing” rather than “stopping” and thus present a defini- 
tional problem for the distinction between natural and controlled fertility 
(Knodel 1983). Indeed, there is considerable evidence that lengthening of birth 
intervals was an important component of historical fertility transitions (Bean 
et al 1990:207) and contemporary fertility declines (Caldwell et al 1992). 

Another problem is that variations in natural fertility (that are not subject 
to conscious planning) are deemed to be theoretically unimportant. Although 
the rapid pace of decline in marital fertility that eventuates in very low fertility 
may require conscious use of birth control, the variety of social customs that 
regulate marriage behavior, divorce, widow remarriage, spousal separation, 
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length of breastfeeding, and coital frequency may well reflect the impact of 
economic and environmental influences on reproduction. In other words, con- 
sciousness may be an important intervening variable, but it is not an indis- 
pensable ingredient in the story of how fertility is socially regulated. 

The other dilemma of the "natural fertility-to-controlled fertility" scenario 
is the need to show that pretransition societies did not know how to regulate 
marital fertility (Knodel & van de Walle 1979). If the means of fertility 
regulation were widely known to married couples in pretransition populations, 
it is difficult to maintain the assumption of natural fertility in all pretransition 
populations. The evidence is, however, mixed, and several well-chosen exam- 
ples do not establish the pattern for all pretransition societies. Counter-exam- 
ples of control of marital fertility in pretransition populations can also be cited 
(Wrigley 1966, Andorka 1982). Blake (1985) argues that there is more conti- 
nuity than divergence in the practice of marital fertility control among pre- 
and post-transition societies. 

Van de Walle (1992) has argued that pretransition societies do not have a 
clear concept of desired family size and that there is a lack of conscious 
thinking about family size. This may be true in many instances, but it does 
not mean that fertility behavior was not "regulated" in many pretransition 
societies. My conclusion is simply that the focus on the emergence of fertility 
control is both too narrow (much of significant fertility variation is ignored) 
and unnecessary—individuals and populations can reduce fertility without 
necessarily planning consciously a desired family size. The distinction between 
natural and controlled fertility may be an important element in the process of 
modern fertility transitions, but the current emphasis seems disproportionate 
to its value. 


HISTORICAL AND CONTEMPORARY TRENDS IN 
HUMAN FERTILITY 


The assumption of constant high fertility in pretransition societies is widely 
held, but there is considerable evidence of systematic variation. One of the 
most significant findings of recent research is that fertility is higher in intensive 
agricultural societies than in forager (hunting and gathering) and horticultural 
(swidden agricultural) societies (Gillian et al 1993). The reasons for this 
difference (and the behavioral mechanisms) are unclear, but possible explana- 
tions could be related to higher mortality in agrarian societies, problems of 
caring for too many infants and small children in migratory populations, and 
earlier weaning of infants in settled agricultural populations. Anthony Reid 
(1988:158, 162, 1992:461) speculated that the spread of world religions (Islam 
and Christianity) in settled agricultural populations in Southeast Asia led to a 
rise in fertility relative to that of migratory populations with traditional animist 
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beliefs. He reasoned that traditional belief systems did not forbid premarital 
sexuality which coexisted with endemic gonorrhea resulting in high levels of 
sterility. 

There were systematic oscillations in fertility in preindustrial Europe in 
response to changes in economic conditions, primarily through the mechanism 
of marriage (Wrigley 1969, Galloway 1988, Wilson & Woods 1991). The 
European (actually Western European) marriage pattern, which Coale (1973) 
labels the “Malthusian transition," emerged sometime during the Middle Ages. 
There were two components of the European marriage system, the high average 
age of people at marriage and the high proportion of people who never married. 
These patterns varied widely within and between populations, and they rose 
and fell in response to economic conditions (Goldstone 1986, Hajnal 1965). 
At times of economic crisis, the average age of marriage could rise to 30 
(Wrigley 1966) and the proportion of a cohort never-marrying to above 2096 
(Goldstone 1986). One major consequence of the European marriage pattern 
was that pretransition fertility in Europe was at moderate levels relative to 
“high fertility" levels elsewhere. 

Traditional levels of fertility in non-European societies were higher (approx- 
imately 6-8 births per woman compared to 4—5 births per woman in many 
pretransition European populations), but they were also socially regulated by 
"jntermediate variables" other than age at marriage and the proportion ever- 
married, for instance, spousal separation, breastfeeding, sexual abstinence, 
nonmarriage after widowhood, and so forth. In an interesting illustration of 
the differing mechanisms of fertility regulation, Kumar (1971) showed that 
marital fertility was higher in nineteenth-century Sweden and Finland than in 
twentieth century India. It is unclear if fertility fluctuated in response to 
economic conditions (wages, harvests) in premodern non-European societies 
as it did (via marriage) in Europe. 

The first “modern” fertility transitions began in early nineteenth century 
France and the United States (Coale & Treadway 1986:37, Tolnay & Guest 
1984, Sanderson 1987, Haines 1989, Bean et al 1990, Gutmann & Fliess 1993). 
The rest of Europe followed suit, about a half-century later, in the last three 
decades of the nineteenth century. These trends, which consisted almost en- 
tirely of declines in marital fertility, were largely completed by the 1930s. The 
patterns are extensively documented in the many publications of the Princeton 
European Fertility Project (summarized by Coale & Watkins 1986). If the onset 
of a marital fertility transition is marked by a decline of 10% from the pre- 
transition plateau, the descent to about 5096 of the pretransition level was 
largely completed in 30 years (Coale & Treadway 1986:40—41). In contrast to 
the fluctuations in fertility in earlier centuries, these modern fertility transitions 
were permanent reductions. 

There were "baby booms" with a modest resurgence in fertility in some 
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Western societies during the 1950s (the United States, in particular), but these 
were temporary and did not represent a return to pretransition fertility levels. 
Over the last 20 years, fertility in most industrial countries has remained at 
the lowest levels ever recorded— well below the replacement level (Davis et 
al 1986). Japan, the most modern non-Western society, followed the European 
pattern fairly well. Japan's modern demographic transition began in the late 
nineteenth century but was interrupted by World War II and its immediate 
aftermath (Mosk 1979, Hanley 1979). The rapid decline to replacement-level 
fertility in Japan in the 1950s is a continuation of trends firmly established 
earlier in the century. 

In the 1960s, fertility began to decline in some developing countries. Ini- 
tially, these declines were limited to a small number of rapidly modernizing 
societies in East Asia, and it was unclear whether these trends would spread 
to the larger and poorer countries in the Third World. In the last decade modern 
fertility transitions have begun in most countries in East Asia, Southeast Asia, 
South Asia, and Latin America (McNicoll 1992, Palloni 1990, Feeney et al 
1989, Robey et al 1992, Freedman & Blanc 1992, Leete & Alam 1993, 
Hirschman & Guest 1990). There are still many areas with high fertility in 
South Asia and sub-Saharan Africa, but recent evidence shows that fertility is 
beginning to decline in many of these countries as well (Menken & Phillips 
1990, Caldwell et al 1988, 1992, Robinson 1992, Brass & Jolly 1993, Ruten- 
berg & Diamond 1993). 

There has been some concern that the rate of fertility decline has slowed 
down in some developing countries in the last decade—stalling at a plateau 
of moderate fertility inthe range of three or four births per woman (Bongaarts 
1987, Horiuchi 1992). Freedman & Blanc conclude, however, that the slow- 
down is largely concentrated in East Asia where levels are reaching the re- 
placement floor (1992:45). There is evidence of below-replacement fertility 
emerging in a number of developing countries including Taiwan, South Korea, 
Singapore, and Thailand (Rele & Alam 1993, Hirschman et al 1994). 

At present, a crude threefold division of the world fertility regimes might 
be: (i) industrial societies, that have experienced long-term fertility declines 
beginning in the nineteenth or early twentieth century and that currently have 
fertility at or below the replacement level, (ii) developing societies, that have 
experienced significant fertility declines over the last 10 to 25 years and where 
current total fertility rates are between 2.5 and 4 births per woman, and (iii) 
less developed countries, that have yet to experience significant fertility re- 
ductions and where average childbearing levels exceed 5 births per woman. 
As soon as such a typology is presented, there are qualifications about the 
homogeneity of the categories and the placement of particular countries or 
regions in these categories. A number of developing countries have entered 
(or shortly will enter) the first category of below-replacement level fertility. 
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And more and more countries are moving from the third to the second category. 
Any cross-sectional portrait of a rapidly changing historical process is out of 
date as soon as it is published and is likely to be a poor image of the future. 


THEORIES AND MODELS OF FERTILITY CHANGE 


Interpretations of modern fertility declines ranged widely in the late nineteenth 
and early twentieth centuries, mixing social explanations with speculations 
about the impact of industrial life on biological capacities to reproduce and 
on declining sexual motivations. Demographers, most notably Warren Thomp- 
son (1929), emphasized the social and economic forces of modern societies 
as the basic causes of lowered fertility. These ideas crystallized over the second 
quarter of the twentieth century and emerged in the 1940s as Demographic 
Transition Theory (DTT), largely under the pen of Frank Notestein, as a 
synthesis of ideas and empirical correlations based on the historical experiences 
of Western Europe and North America (Notestein 1945, 1953). 


Demographic Transition Theory 


Notestein emphasized the changing institutional fabric of urban industrial 
society that led to the “emergence of a new ideal in matters of family size” 
(1953:16). Among the motivating factors of modern society were: “reduced 
... pressures toward traditional behavior,” “education and a rational point of 
view,” “the cost of child-rearing grew and ... economic contributions by 
children declined,” and that “women ... found new independence from house- 
hold obligations and new economic roles less compatible with child-rearing” 
(Notestein 1953:16). The means for fertility control was contraceptive use by 
married couples—which had “been widely used for centuries throughout the 
world,” but was “not widely used until the incentive for birth restriction became 
strong” in industrial society (Notestein 1953:16—17). 

The corpus of transition theory was very broad. Within the same theoretical 
tent, Kingsley Davis could minimize the role of religious and cultural values 
as primary determinants of fertility while Ronald Freedman was pointing to 
the central role of norms for family size (Davis 1955, Freedman 1963). Critics 
of the 1980s and 1990s who charge demographic transition theory with being 
narrowly economistic or with ignoring the role of ideas have missed the central 
point: that transition theory had room for every causal variable. 

The two modern seminal contributions in the development of demographic 
transition theory (and the most widely cited) are Kingsley Davis's "Theory of 
Change and Response in Modern Demographic History" (Davis 1963) and 
Ansley Coale's 1973 essay on the demographic transition, which reflected 
many of the findings from the Princeton European Fertility Project (Coale 
1973). It is hard to imagine two articles more different, yet the field has 
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accepted both as part of a common theoretical stream and not as contradictory 
theses. 

Davis (1963) put the conscious use of contraception by married couples as 
only one of many possible responses in his “multiphasic theory" of demo- 
graphic change. The most important independent variable is the level of house- 
hold economic strain, which is a function of household size and potential 
economic resources. High levels of household economic strain were the mo- 
tivating factor that caused individuals (and families) to postpone marriage, 
never marry, migrate, use abortion, practice infanticide, and restrict marital 
fertility by contraception. All of these were mechanisms to maintain (or im- 
prove) economic welfare. Davis argues that most societies will use all of these 
methods (in varying proportions) in response to the population pressure (man- 
ifest as economic strain) caused by mortality declines and the economic op- 
portunities that accompany modernization. Absolute poverty does not provide 
a context for demographic responses, according to Davis; it is the possibility 
of betterment (or of holding onto recent improvements) during the moderniza- 
tion process that provides the central motivation. 

Although Davis's article is widely cited in the demographic transition liter- 
ature, the body of research that tests hypotheses from his theory is relatively 
modest (Friedlander 1969, 1983, Mosher 1980a, 1980b). One problem of 
testing Davis's theory is the identification and measurement of economic strain 
that is a function of potential household size and economic aspirations. If 
economic welfare rises faster than aspirations, there would not be any pressure 
for changes in demographic behaviors. Modernization may have direct impacts 
on the motivation for fertility (and other demographic behaviors) indepen- 
dently of changes in economic strain caused by declining mortality. 

In contrast to Davis's broadening of the range of demographic responses, 
Coale (1973) focused on the reasons for declines in marital fertility alone. He 
identified three necessary conditions for the decline of marital fertility: (1) a 
setting that allowed for fertility planning to be part of the calculus of conscious 
choice, (ii) the availability of effective information about the means to control 
fertility, and (iii) clear economic advantages of fertility control. The third 
condition is the standard link to the traditional thesis of demographic transition 
theory (DTT) that socioeconomic factors change the incentives for childbear- 
ing. The first two conditions are ones that demographers had pretty much taken 
for granted—that societies adjusted demographic behavior when circumstances 
warranted and that some knowledge about fertility control was available in 
most historical societies. 

The three preconditions in Coale's framework fit well with Henry's (1961) 
thesis that natural fertility was practiced in pretransition societies. Because the 
last condition was already well known, most subsequent attention has been 
focused on the observation that natural-fertility societies lacked legitimating 
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cultural values for fertility control and information about the means to control 
fertility (although Coale acknowledges that folk methods of contraception were 
used in some pretransition populations; see Coale 1979:15). The widely cited 
evidence that many women in traditional societies cannot answer questions 
about desired family size (lacking numeracy about family size; see van de 
Walle 1992) illustrated the cultural roots of high fertility. Coale's formulation 
represented a shift away from the central theme of Demographic Transition 
Theory, that changes in socioeconomic institutions are the primary precursors 
to fertility decline. The initial challenge to DTT was, however, not theoretical 
but empirical. Without disconfirming evidence, Coale's categories would prob- 
ably have found their place under the broad umbrella of DTT, where many 
variant ideas were often expressed with little internal conflict. 

Beginning in the 1970s and throughout the 1980s, empirical studies of 
fertility of both historical and contemporary societies reported findings that 
were at odds with the expected associations of socioeconomic variables and 
fertility. The most famous was an article by Knodel & van de Walle (1979) 
that summarized the findings of the Princeton European Fertility Project with 
a discussion of the implications for future fertility transitions in less developed 
countries (the article was later published as chapter 10 in Coale & Watkins 
1986). Knodel & van de Walle reported that fertility declines began about the 
same time in a number of European countries that were at quite different levels 
of socioeconomic development. They emphasized cultural setting and diffusion 
as the critical elements to explain the spread of fertility control in Europe. In 
subsequent summaries of the Princeton European Fertility Project (EFP), Wat- 
kins (1986, 1987) also concluded that the results disconfirmed the empirical 
predictions of standard demographic transition theory. 

About the same time, results on fertility change from the contemporary 
less-developed countries were being published from the World Fertility Survey 
(WES) program (Cleland & Hobcraft 1985, Cleland & Scott 1987). Cleland 
summarized the results of the comparative WFS analyses of marital fertility 
as representing a clear refutation of the “demand” (demographic transition 
style) theories (Cleland 1985, Cleland & Wilson 1987). Few of the expected 
associations between socioeconomic variables and fertility (with the exception 
of female education) predicted by DTT were consistently found in the com- 
parative analyses of the WFS. 

This double whammy (from the EFP and the WFS) created a crisis for 
demographic transition theory. It is now the conventional wisdom in many 
circles, inside the field of demography and beyond, that demographic transition 
theory is near death. In the overview theoretical chapter in a book of historical 
analyses of European fertility declines, Alter states that “In the last 20 years, 
the field’s dominant consensus, the theory of the demographic transition, has 
been dramatically shattered” (1992: 13). Theories, however, rarely pass from 
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the scene unless there is a superior alternative that provides a clearer account 
of research in the field. The dilemma is that there is no consensus on an 
alternative theory to replace demographic transition theory. A theoretical vac- 
uum is not tolerable for an empirical science with a continuous need for testable 
hypotheses. So the debates continue with a plethora of contending theoretical 
frameworks, none of which has gained wide adherence. 


Caldwell's Theory of Intergenerational Wealth Flows 


In a series of influential articles and books, Jack Caldwell (1976, 1980, 1982, 
Caldwell et al 1988) criticized the economistic biases of traditional demo- 
graphic transition theory and offered his revisionist theory of intergenerational 
wealth flows. In Caldwell's theory there are only two stable fertility regimes— 
traditional societies where fertility is as high as possible and modern societies 
where childbearing is low. In traditional societies, children provide a positive 
net flow of resources, services, and status-honor up the generational ladder to 
parents, especially to the patriarch. In stable high-fertility societies, there is 
always an incentive for additional children. In modern societies, there is a 
reversal of the wealth flow, and parents contribute time, money, services, and 
support downward to children with minimal expectations of any return. Ac- 
cording to Caldwell, there are few economic incentives for fertility in modern 
child-centered societies. 

The reasons for the destabilization of high-fertility regimes are not a simple 
matter in Caldwell's theoretical writings. In some places, he emphasizes the 
diffusion of Western cultural models of the nuclear child-centered family via 
international communications and the mass media. In other contexts, Caldwell 
has stressed the impact of “mass schooling” in a community, which tips the 
balance to lowered fertility. His theory has been interpreted as supporting 
aspects of both structural and cultural theories. 

The principal obstacle for many scholars in the field has been the lack of a 
clear model for conducting empirical tests of hypotheses from Caldwell's 
theory. Caldwell's insistence that demography must shift its focus to more 
ethnographic investigations (“micro” approaches) has not clarified the task. 
Measuring intergenerational wealth flows (or perceptions of the flows) over 
historical time is a difficult challenge. The methodological problems include 
the measurement of the nonmonetary components of wealth flows, such as 
services, deference, emotional gratification and undefined obligations. Beyond 
Caldwell's own research, there are relatively few empirical studies of the 
intergenerational wealth flow hypothesis. 


Economic Theories of Fertility 


There are two major economic approaches to the study of fertility change and 
many variants (Sanderson 1976). The first is the "new home economics," 
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which is the application of microeconomic theory to family issues, including 
fertility (Becker 1960, 1988, Schultz 1981). The second is the synthesis of 
economic and sociological theories of fertility presented in the supply and 
demand framework of Richard Easterlin. 

The first applications of microeconomic theory to fertility behavior were 
rather mechanistic illustrations of consumer choice theory with little acknowl- 
edgment of the significant differences between the acquisition of an automobile 
and a baby. These early applications of economic theory to fertility inspired 
Judith Blake's devastating critique (Blake 1968, also see Turchi 1975). Over 
the years, however, economists have become more sophisticated in their ap- 
plication of economic theory to household behavior, and they have also become 
more rigorous in the measurement of economic behavior in their empirical 
analyses. Ideas and hypotheses have also gradually drifted across disciplinary 
lines. Demographic research that includes the opportunity costs of women's 
time as well as the impact of the prices and incomes on demographic behavior 
is now much more common in the literature. There is some evidence in support 
of economic hypotheses (Hutaserani & Roumasset 1991), but the approach 
remains too narrow to be a significant theoretical challenge to demographic 
transition theory. As Robinson (1992:453) notes, "the proposition (microeco- 
nomic theory of fertility) has not been proven, only asserted often enough to 
gain a certain credibility and force through repetition." 

In several essays and a major book, Richard Easterlin has made a serious 
effort to join economic theories with more traditional sociological research on 
fertility (Easterlin 1969, 1978b, 1983, Easterlin & Crimmins 1985). Two 
volumes of state-of-the-art essays on fertility in developing countries, spon- 
sored by the National Academy of Sciences, were organized around Easterlin's 
framework of the three basic determinants of fertility: demand, supply, and 
the costs of fertility regulation (Bulatao & Lee 1983). Demand factors include 
the standard socioeconomic determinants of fertility from modernization (de- 
mographic transition) theory. Supply factors are environmental and cultural 
factors that constrain natural fertility. The costs of fertility regulation include 
the monetary, time, and psychic factors associated with the use of contracep- 
tion. In a series of ingenious graphs, Easterlin shows how modernization can 
lead first to a rise and then to a fall in fertility as the forces of supply, demand, 
and regulation costs combine to shape fertility behavior (1983:566-574). 

The inclusiveness of Easterlin's model has clarified some empirical anom- 
alies in the study of fertility trends. Nevertheless, his model shares two limi- 
tations of prior work in the literature. First, the assumption of natural fertility 
means that the wide variations in pretransition marital fertility (and marital 
behavior) are outside the scope of the theory. Second, there is no effort to 
resolve the theoretical and empirical problems in the specification of what 
socioeconomic variables account for demand. In general, demand for fertility 
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does decline with modernization, but the question of the many weak associa- 
tions between the standard predictor variables and fertility remains unresolved. 


Ideational Theory 


Within the dominant branches of American demography, there has been per- 
sistent skepticism that values, attitudes, and other psychological orientations 
can explain fertility trends or variations between populations. It is, therefore, 
perhaps appropriate that the new interpretation of fertility linked to deep 
cultural variables, known as ideational theory, was developed primarily by 
European demographers, especially Ron Lesthaeghe (Lesthaeghe 1980, 1983, 
Lesthaeghe & Surkyn 1988). Much of the impetus for ideational theory has 
been the alleged failure of demand (socioeconomic) theories (Cleland 1985, 
Cleland & Wilson 1987). Culture spans a wide variety of phenomena, and 
there are quite varied meanings of the term in the demographic literature 
(Hamrnel 1990, Pollak & Watkins 1993). This gives rise to not one, but several 
theories of culture and fertility. 

The strong cultural hypothesis is that groups differ in fertility behavior 
because of cultural values. Some populations may have higher levels of fertility 
than other groups with equivalent socioeconomic characteristics because their 
culture places a higher value on children or proscribes certain methods of 
fertility control. Lesthaeghe & Surkyn (1988) argue that historical variations 
in European fertility are closely related to national differences in religious 
beliefs, individualism, and secularism. Even if this argument is accepted, the 
question of the origins of cultural values remains unresolved (Preston 1986: 
186—189). Davis (1963) dismissed all cultural theories because of the inherent 
circularity of the basic logic, that is, behavioral patterns are explained in terms 
of cultural preferences for that behavior. To avoid this fault, cultural variables 
need to be explained in terms of structural conditions or historical experiences. 

Traditional values are typically rooted in rural environments and among 
recent migrants to urban areas. Cultural values, however, may persist long 
after the structural conditions in which they originated have eroded. Lesthaeghe 
& Surkyn (1988) argue that the rise in fertility after World War II (baby boom) 
may be explained by the orientations of generations that were reared in tradi- 
tional prewar environments (the very low prewar fertility was a product of 
economic constraints). According to this argument, the change in values after 
World War II toward greater materialism and individualism led to the lowered 
fertility in the 1960s and 1970s. 

A weaker version of the cultural interpretation posits that ideas about ap- 
propriate family size and methods of birth control can diffuse more quickly 
within culturally homogeneous populations (Retherford 1979, Retherford & 
Palmore 1983). The critical assumption is that the lag period between structural 
changes and demographic responses can be shortened or lengthened by in- 
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tragroup communication about fertility ideals, the legitimacy of fertility con- 
trol, and techniques of birth control and abortion. Interpretations based on 
studies of the European demographic transition suggest that the cultural dif- 
fusion of ideas (and knowledge about the practice) of birth control can even 
precede the structural changes in society. The result is that patterns of fertility 
decline are more likely to mirror geographic maps of ethnic groups than of 
socioeconomic change (Watkins 1986, 1987). 

The empirical base for ideational theory seems to rest more on a negative 
case for the alternative model—the rejection of socioeconomic explanations— 
than on positive evidence (Cleland & Wilson 1987). The claim that fertility 
declined at about the same time in a variety of socioeconomic settings does 
not mean that culture is the prime mover (Mason 1992). It is clear that diffusion 
is an important process in the explanation of fertility, but the links between 
culture and diffusion have yet to be clearly articulated and empirically tested. 


UNRESOLVED ISSUES IN RESEARCH ON FERTILITY 


The absence of a hegemonic theory of fertility does not mean that there is not 
a great deal of knowledge and many empirical generalizations about the de- 
terminants of fertility in various settings. The existing base of knowledge, 
however, cannot be summed up in one or two global statements. In this section, 
Ireview some of the major issues that require further empirical understanding 
(and perhaps some new insights) before it will be possible to frame a unified 
theory of fertility transitions. 


Marriage, Family Structure, and Fertility 


In most contexts, human fertility occurs in family units. In fact, the standard 
definition and functional explanation of familial institutions are invariably 

presented in terms of childbearing and childrearing. Although there are close 

connections between family structure and fertility, the relationship is not sim- 

ple. Prior research has examined many strands of the relationship, including 

the timing and prevalence of marriage, the complexity of household structure, 

residence patterns after marriage, and inheritance patterns. There seem to be 

few universal relationships. 

The idea is sometimes expressed that age at marriage does not necessarily 
have a strong influence on fertility because there is generally sufficient time 
to “make up" for any delay in childbearing following marital postponement. 
Although this is logically true for individuals, aggregate patterns almost in- 
variably show that later marriage means lower overall fertility (Smith 1983). 
Indeed, the principal mechanisms of fertility regulation in premodern Europe 
were variations in marriage timing and the prevalence of marriage (Coale & 
Treadway 1986:47—48). The partial breakdown of the system of marital post- 
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ponement is thought to have contributed to a rise in fertility in the early stages 
of modernization (Tilly 1978, Goldstone 1986). 

If marriage was the traditional mechanism of fertility control in Europe, and 
the modern European fertility transitions from 1870 to 1930 were largely 
confined to declines in marital fertility, then a reasonable hypothesis might be 
that changes in marriage patterns do not covary with changes in marital 
fertility. Indeed, this logic has been implicit in much of the historical demog- 
raphy literature. Therefore, Coale's (1992) recent paper which shows a close 
association of delayed marriage and the advent of fertility control within 
marriage (for both historical and contemporary populations) raises a number 
of significant questions. Coale reasons that cultural contexts that lead to later 
marriage are also favorable for female autonomy and control over reproduc- 
tion. The finding is also consistent with Davis's (1963) multiphasic theory of 
demographic response. $ 

In many developing countries, there have been trends toward delayed mar- 
riage (Smith 1980) and also declines in marital fertility. Although the largest 
component of Asian fertility declines has been declines in marital fertility 
(Retherford & Cho 1973, Hirschman & Guest 1990), changes in marriage 
patterns have also been significant in some cases. The negative impact of 
marriage timing on fertility is confounded, however, by the fact that marital 
postponement often leads to a shorter first birth interval (Hirschman 1985). 
This apparent anomaly has been explained by Rindfuss & Morgan (1983) as 
a result of a trend toward romantic marriages and increased coital frequency 
early in marriage. The evidence is largely indirect, but the strength of the logic 
and replication of the basic findings support Rindfuss & Morgan’s interpreta- 
tion (Fricke & Teachman 1993). Although the trend toward marital postpone- 
ment may lead to shorter birth intervals in the short run, the long-run expec- 
tation is lowered marital fertility. 

Another emerging pattern is a lowered prevalence of marriage in societies 
that formerly had universal marriage. Recent data show double-digit percent- 
ages of women in their early thirties who have not yet married in several Asian 
countries (Lim et al 1987, Xenos & Gultiano 1992, Guest & Tan 1993). This 
increase in the numbers of never-married will have significant implications 
for future levels of fertility and other social patterns. 

The classic interpretation of the impact of family structure on fertility was 
presented almost 40 years ago by Kingsley Davis (1955). He noted the strong 
association of extended family households with higher fertility in less devel- 
oped areas and argued that the extended family structure provided extensive 
supports for early marriage and childbearing. Marriage did not have to be 
postponed until an independent household could be set up. The costs of chil- 
dren and the burden of child care are subsidized by other members of the 
household, not borne exclusively by the parents. In extended family house- 
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holds, the young married couple, especially the bride, acquires recognition and 
status with the production of children. This thesis conforms to the traditional 
contrast of the nuclear family system in many industrial countries with the 
traditional pattern of young and universal marriage in other parts of the world 
(Hajnal 1965, Dixon 1971). But there may be more flexibility between family 
structure and fertility in modern industrializing societies. 

Ronald Freedman and his colleagues have been monitoring changes in the 
structure of Chinese families and fertility in Taiwan for several decades (for 
the latest report, see Weinstein et al 1990). Initially it was assumed that there 
would be a close relationship between extended family living arrangements 
and fertility—both declining with modernization. The Taiwanese fertility tran- 
sition is now complete with a below-replacement fertility rate, but co-residence 
of married couples with husband’s parents remains an important aspect of 
family structure in Taiwan. About one half of adults live in extended family 
units, and most couples live with the husband’s parents for at least some time 
after marriage. There has been some decrease in joint family living arrange- 
ments in Taiwan, but the continuity is most impressive. 

The persistence of traditional Chinese family living arrangements in Taiwan, 
however, has not meant that fertility has remained high. It may be that tradi- 
tional family structures in traditional societies result in high fertility, but 
traditional family forms can accommodate new content, including lowered 
fertility, in modernizing societies. Axinn (1992) reports that exposure to ex- 
trafamilial activities by husbands and wives leads to increased contraceptive 
use in Nepalese society. Family structures in developing countries are changing 
rapidly, as they are in industrial countries, and future patterns are unlikely to 
fit into neat categories of traditional and modern structures (Thorton & Fricke 
1987). Enormous methodological problems confront research on the dynamic 
relationships between family structure and reproductive behavior (Burch 
1983). Further progress may require longitudinal studies that track family 
interactions and exchanges across generations that are knit more by economic 
and social obligations than by common residence. 


Modernization and Fertility 


The concept of modernization has undergone even more trials than the theory 
of the demographic transition. Even if synonyms (socioeconomic change, 
development) can be substituted as temporary remedies, the conceptual crisis 
remains—we do not have a precise account of the social, economic, and 
cultural forces that are necessary or sufficient conditions to transform low 
income, rural agrarian societies into high income, urban industrial ones, As 
with demographic theory, the search for a grand theory of modernization has 
often given way to a search for empirical patterns that may vary with specific 
institutional and historical circumstances. 
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Given that it is difficult to specify the components of modernization, it is 
not too surprising that the theory of modernization and fertility change (de- 
mographic transition theory) has encountered problems. Nevertheless, conclu- 
sions that socioeconomic forces are unimportant because fertility may be 
weakly correlated with "modernization" variables are premature (Cleland & 
Wilson 1987). The basic flaw in demographic transition theory, in my opinion, 
is the assumption that there is a single monolithic pattern of modernization 
that could be indexed by any socioeconomic variable. This assumption made 
it possible to use weak associations (and occasional negative evidence) as the 
basis for claims that all efforts to develop a theory of modernization and fertility 
are futile. The real theoretical challenge is to specify more clearly what aspects 
of modernization are linked to fertility change. 

The weakness of some conventional research within the DTT tradition can 
be illustrated by considering the expected negative relationship between fer- 
tility and general modernization factors such as income or industrialization. Jt 
is true that both higher incomes and greater industrial employment are central 
elements of the broad complex of modernization forces that have transformed 
the world over the last century. This does not mean that higher income or 
industrial employment, by themselves, will motivate families to have fewer 
children. In fact, the reverse is equally plausible. The most direct consequences 
of an increase in income are higher levels of consumption. If children are 
highly valued in a society, economic theory would predict a higher demand 
for children. Economic theory is, however, indeterminate as to the effect on 
fertility because quality (higher-cost children with greater levels of investment 
and consumption) and quantity are substitutes. Higher levels of income may 
also loosen constraints to higher fertility, for example, less absence of the 
husband in search of employment, lower labor force activity by the wife, less 
breastfeeding. In a similar fashion, industrial employment may lead to younger 
age at marriage (and higher fertility) as traditional constraints on family for- 
mation are eased (Goldstone 1986, Haines 1979). 

There are frequent findings of a rise in fertility before the transition to low 
fertility begins (Dyson & Murphy 1985). Easterlin explains this pattern as a 
product of a rise in "supply" factors (less breastfeeding, reduced sterility, early 
marriage, etc) in societies where demand for fertility exceeds actual fertility 
levels (1983:574). What is missing in Easterlin's explanation is a specification 
of what aspects of modernization lead to an increase in supply and what factors 
will lead to a decrease in demand in fertility. Davis (1963) argued that it was 
necessary to link the broader macrolevel forces with the microlevel motivations 
for lowered family size (with declines in fertility being one of several mech- 
anisms to this end). In Davis's theory, the critical macrolevel factor was 
declining mortality, which led to household economic strain in the context of 
rising aspirations and new economic opportunities. The logic of Davis's argu- 
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ment is compelling—the expectation is not that any and all modernization 
variables will correlate highly with fertility decline, only those aspects of 
modernization that create direct incentives for smaller families. 

Freedman (1979) hinted at a new approach in his classic assessment of the 
state of theory and research on fertility. After reviewing recent research that 
showed the weak empirical evidence for traditional demographic transition 
theory, Freedman noted that there were a variety of sufficient conditions, 
exemplified by different countries (or regions within countries), that could lead 
to modern fertility transitions. Certain variables, or combinations of variables 
(effective family planning programs, higher status of women, higher levels of 
social welfare, extreme population pressure, a strong state, etc), could lead to 
lowered levels of fertility even in the absence of other variables considered 
essential prerequisites for fertility transitions. Unfortunately, subsequent the- 
oretical writings have not attempted to extend Freedman's analysis beyond the 
discussion of historical examples to the specification of hypotheses that relate 
critical factors (modernization variables) to fertility declines in various settings. 

Perhaps the most effective strategy for the specification of a broader theory 
of modernization and fertility theory might be to identify the microlevel family 
motivations for fewer children and then to work outward to the relevant 
structural forces. The core idea of demographic transition theory is that the 
costs and benefits of children, both short-term and long-term, shape motiva- 
tions for childbearing (Caldwell 1983). All other causal factors may be derived 
from this basic premise. Most obvious are the standard links to infant and child 
mortality (assuming that there is a threshold of family size where benefits 
decrease and/or costs rise), changes in the family economy (less dependence 
on child labor), and rising levels of education (increasing the cost of childrear- 
ing). The cost of children is not limited to direct monetary expenditures but is 
also evident in the time commitments necessary for childrearing. If there is a 
high opportunity cost of the mother's time or an absence of other family 
childcare providers or both, the cost of childrearing is much higher. Extensions 
of this basic logic can be applied to other socioeconomic variables, but a 
credible hypothesis must specify the causal links in terms of the incentives for 
childbearing. Care must be taken to consider the broader social and institutional 
context that may confound expected bivariate relationships. There may also 
be important thresholds or floor effects; for example, one or two children might 
be desired even if the costs of children are very high. 

These issues can be illustrated with recent literature on the slow-down or 
stall in the fertility transition of the Malay population in Malaysia. Rapid 
modernization in Malaysia has brought very low fertility to the Chinese and 
Indian populations there (almost to the replacement level), whereas Malay 
fertility, after experiencing a modest decline during the 1970s, has remained 
at a plateau of over four births per woman during the 1980s. Leete & Tan 
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(1993) interpret these differentials as evidence that demographic transition 
theory is inadequate and that cultural variables, most notably Islamic funda- 
mentalism, are necessary to explain why Malay fertility has not been reduced. 
It is difficult to evaluate post hoc explanations, but the lack of similar response 
among Malay populations in Singapore and Indonesia creates doubts that a 
simple cultural explanation is sufficient (Jones 1990). Government policies of 
Malay preference (affirmative action programs provide more education and 
employment opportunities for Malays than for Malaysian Chinese and Malay- 
sian Indians) also seem to be correlated with diverging ethnic fertility differ- 
entials in Malaysia, although it is difficult to specify the precise mechanisms 
(Govindasamy & DaVanzo 1992). It is not possible to evaluate these conflict- 
ing claims empirically, but rival "demographic transition" hypotheses are also 
consistent with the reported trends. Malaysian Malay parents have been shel- 
tered from the rising costs of childbearing by government subsidies for edu- 
cation (including college) and by a greater availability of low-cost childcare 
(Hirschman 1986). Although these ideas have not been empirically proven, 
they illustrate the potential importance of some socioeconomic variables (and 
not others) as determinants of fertility levels and change. 


Culture, Diffusion, and Fertility 


A superficial look at the recent literature on fertility may give the impression 
of a battleground between structural and cultural interpretations. This is, how- 
ever, a mock battle played out by a few cases of extravagant claims and 
counter-claims for the benefit of graduate students who are seeking simple 
conceptual schema to organize the literature (see Mason 1992 for a critique 
of some of the rhetorical sleight-of-hand in Cleland & Wilson 1987). Not only 
is the evidence mixed, but most researchers acknowledge the interdependent 
role of social structural conditions that motivate behavior and the spread of 
ideas and information that reinforce behavioral change (Carlsson 1966, Fried- 
lander et al 1991, Gillis et al 1992, Woods 1987). Most structural hypotheses 
are perfectly compatible with conscious decision-making as a potential inter- 
vening process. Theories of normative influences on behavior rarely deny that 
deep cultural values or innovative ideologies are rooted in historical experi- 
ences or material interests. Uncertainties arising from cultural lags, weak 
correlations, and the general lack of comprehensive data have led to differing 
emphases stressing certain variables as more important, but the number of 
demographers who subscribe to completely monolithic explanations is very 
small. 

Indeed, there is often agreement on which variables are most important, but 
differing interpretations of what the variable means. One of the most consistent 
findings in the literature is a negative relationship between women's education 
and fertility, both at the individual and the aggregate level (Cochrane 1979, 


WHY FERTILITY CHANGES 223 


1983, Cleland & Rodriguez 1988). There are exceptions, but the relationship 
is one of the most robust in the empirical literature. Although the general 
interpretation is that education is a socioeconomic variable that raises the cost 
of children (directly and indirectly because of the value of the mother's time), 
Cleland & Rodriquez (1988) argue that the influence of education is primarily 
ideational because most of the impact of female education is not mediated by 
employment. The same argument could be developed for other significant 
determinants of fertility without any clear resolution. Differences of interpre- 
tation are not entirely empty rhetoric, but convincing arguments should go 
beyond simply claiming a variable for one camp rather than another. 

Two recent studies illustrate the complexity of social context and the role 
of sociocultural processes on human fertility. In one of the most important 
studies from the European Fertility Project, Lesthaeghe & Wilson (1986) found 
that “secularization” was an important predictor of the pace of fertility decline 
in a number of European countries. In an imaginative analysis, Lesthaeghe & 
Wilson showed that both socioeconomic structure (the mode of production, 
indexed by labor force composition) and secularization (vote for socialist or 
nonreligious parties) were important forces that led to more rapid declines in 
marital fertility. The authors reasoned that the political and cultural forces that 
led to a decline in support for traditional religious parties were part of a larger 
shift in the moral code that also legitimated nontraditional reproductive be- 
havior (fertility control). 

The argument that secularization led to both nontraditional political behavior 
and nontraditional fertility behavior is convincing. The question remains, how- 
ever: what social conditions fostered the development of secularization? Sec- 
ularization is not simply a product of shifts in economic structure because 
labor force variables (mode of production) were included as predictors in the 
models. It seems probable that the actions of political activists (local or out- 
side), strikes or other consciousness-changing events, the distribution of news- 
papers or pamphlets, rising levels of education, and other social and political 
activities were responsible for the differential growth of secularism in some 
areas. Are these structural or cultural factors? Changes in the organizational 
structure and political experiences of a community are intimately bound up 
with changes in consciousness of the population. The task is to clarify how 
these factors intersected in different historical contexts to change fertility 
behavior (and other outcomes), not simply to claim that it is an "either-or" 
contest between culture and economic forces. 

Another relevant study is the recent research by Goodkind (1991, 1993) on 
the concentration of births in auspicious years of the Chinese zodiacal calendar 
in a number of East Asian populations. Goodkind shows that the "traditional" 
custom of having a baby born in the year of the dragon did not appear in 
Taiwan before 1976. The explanation that the control of the timing of births 
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is not possible in natural fertility populations is disputed by Goodkind, who 
shows that the concentration of births in the dragon year of 1976 was accom- 
plished largely by intermediate variables other than contraception (marriage 
timing, abortion, and coital behavior). If it was always possible to fine-tune 
the timing of births so that there was a concentration in auspicious years, why 
did this tradition begin only in the 1970s? It seems that the celebration of 
cultural practices is closely intertwined with modernity and not simply a 
product of historical continuity. In a survey of the role of cultural factors on 
Chinese fertility, Greenhalgh (1988) argues that cultural factors cannot be 
given a transhistorical role but must be interpreted within particular historical 
and institutional contexts. 

The diffusion of patterns of low fertility is often given as a major reason 
for the importance of cultural factors. Clearly, information and beliefs spread 
more easily within communities that share common cultural and linguistic 
characteristics. But rapid changes in behavior over wide geographic areas do 
not necessarily mean that a common cultural system is an essential prerequisite 
for the diffusion of information or for changes in reproduction. In an excellent 
historical study of a rural Sicilian town, Schneider & Schneider (1992) show 
that several decades separated the fertility declines of different classes in the 
community. The fertility decline of the gentry began around the turn of the 
century, the artisan class developed a pattern of lowered fertility in the 1920s 
and 1930s, and the peasantry followed suit in the 1950s and 1960s. Changes 
in fertility seemed to be more closely associated with changes in social and 
economic incentives than the diffusion of values and information in this com- 
munity. In other contexts, all social classes may face common incentives, and 
the pace of change may be determined solely by diffusion of information and 
the cost of fertility regulation. However, diffusion processes have proven much 
more difficult to model and test than to observe (Rosero-Bixby & Casterline 
1993). 

The classic idea of the diffusion hypothesis is that it requires much more 
motivation to be an earlier innovator than to be a later one. Pioneers have to 
bear the full costs of acquiring new information, independent decision-making, 
breaking with tradition, risking social disapproval, and assuming risks of 
uncertainty of future outcomes. After a significant proportion of a population 
has already engaged in innovative behavior (e.g. fertility control), the costs 
for those that follow are much less. This means that the level of motivation 
required for behavioral change by followers is also much less. Given this logic, 
the association of socioeconomic factors (the motivating conditions) and fer- 
tility behavior may loom much larger in the initial spread of a fertility transition 
than in the latter stages (Hirschman and Guest 1990). 

The incorporation of diffusion processes into studies of fertility decline is 
as important as it is difficult (Casterline et al 1987, Montgomery & Casterline 
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1993). The development of time series data of detailed geographical areas and 
innovative statistical approaches may be essential prerequisites for such work. 


Family Planning Programs and Fertility Declines 


Twenty-five years ago, there was great uncertainty whether family planning 
programs in developing countries would have any impact on fertility declines 
(Davis 1967). Over the last 20 years, a series of cross-national studies have 
estimated the impact of socioeconomic development and family planning effort 
on fertility levels, fertility change, and contraceptive use (Freedman & Berel- 
son 1976, Mauldin & Berelson 1978, Lapham & Mauldin 1984, Mauldin & 
Ross 1991). In spite of some thoughtful skeptics (Hernandez 1981, 1984), the 
general finding is that both socioeconomic conditions and family planning 
programs have significant independent effects on fertility decline, and there is 
a synergistic effect (lowering fertility) of rapid development and an effective 
family planning program. A well-designed longitudinal study of treatment 
(with a family planning program) and nontreatment areas in Bangladesh has 
found strong evidence that effective family planning programs lower fertility 
(Phillips et al 1988). A comparable study of Egypt, however, found mixed 
results of the effects of family planning on fertility attitudes and behavior 
(Stycos et al 1988). 

In spite of the general consensus on the value of family planning programs, 
there is still much debate in the field over the relative efficacy of investments 
in family planning programs (compared to investments in development) and 
over what aspects of programs make a difference. The main analytical problem 
is that family planning programs are not random events but are dispropor- 
tionately located in countries where social and economic conditions are already 
favorable for fertility declines. Skeptics claim that historical fertility declines 
in many Western countries occurred without organized family planning pro- 
grams and that the distribution of contraceptives through the private market 
might well provide most of the services that public family programs currently 
do. These questions are still debated and probably cannot be resolved with the 
standard methods of program evaluation. 

One of the important unresolved questions has been the identification of the 
components of family planning programs and the relative impact of different 
program activities on contraceptive adoption and fertility decline. The primary 
source of data about the attributes of national family planning programs has 
been ratings of program effectiveness collected in questionnaires from knowl- 
edgeable experts (program administrators, funding agency officials, and others 
familiar with family planning programs in various countries) (Mauldin & Ross 
1991). In most studies, the responses from these questionnaires have been 
summarized into one global dimension of "program effort," although there is 
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clearly a set of underlying dimensions that characterize the major features of 
family planning programs (Entwisle 1989). 

Perhaps the most important finding for policy makers is that the availability 
of family planning services, measured by the proximity to clinics, does have 
measurable effect on contraceptive use (Entwisle et al 1986, Tsui & Ochoa 
1992). Research on the identification of the impact of specific aspects of family 
planning programs on fertility behavior and the relative cost-effectiveness of 
different strategies does not seem to have yielded clear answers (Phillips & 
Ross 1992). 


OTHER MODELS AND OTHER QUESTIONS 


The title of this paper presumes that there is an answer to the question of why 
fertility changes. If the empirical research reviewed in this essay is an accurate 
picture of the state of knowledge on the topic, there are many possible answers 
to this question, usually phrased in terms of particular historical conditions. 
The model of social causation, implied by the question, does not accord well 
with such varied responses. The standard social science model is that society 
works pretty much like a regression equation: the task is to find the right set 
of predictors, solve the equation, and discover what factors are most important 
in predicting social outcomes. This framework does lead to empirical gener- 
alizations, but there seem to be endless qualifications about the measurement 
of variables, the meaning and interpretation of variables, the substitutability 
of one variable for another, and complex interactions with historical settings. 
If science is to discover parsimonious principles that explain complex patterns, 
we do not seem to be making progress. 

Perhaps we need to reconsider the question about fertility change in light 
of other models of population and society. The classical model of demography 
is the Malthusian equilibrium, where there are fluctuations of growth and 
decline around the balance of population size and economic resources, Every 
student of demography quickly learns all the shortcomings of Malthus’s prin- 
ciple of population—Malthus did not anticipate technological change nor the 
ability of modern societies to control marital fertility. What is rarely stressed 
is that the equilibrium model did fit rather well with the dynamics of pre- 
industrial societies (Wrigley 1969, Grigg 1980, Galloway 1988). 

The logic of a homeostatic model is a system maintained by countervailing 
pressures. In the Malthusian system, constant pressure for population growth 
is fueled by the passion between the sexes, but this is countered by the negative 
feedback loop of limited food supplies on population size. This negative 
feedback loop (density dependence) effect leads to the positive check 
of increased mortality. Preventive checks (via constraints on marriage) that 
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slowed population growth were the other mechanism to maintain equilibrium 
in the Malthusian model. 

Ronald Lee (1987) has suggested that Malthusian homeostatic principles, 
along with Boserupian forces that created positive feedbacks, were central to 
shaping cycles of population growth and decline until about 100 years ago 
when rapid technological progress broke the negative feedback loop. There 
may be, however, other equilibrating forces that have an impact on population 
growth in the modern era, especially in countries with rapidly growing popu- 
lations. I am not suggesting that there is an instinct for families to have two 
surviving children, only that rapid population growth certainly creates systemic 
pressures (negative feedbacks) for lowered fertility. A homeostatic model, 
incorporating Kingsley Davis's notion of household strain as a central moti- 
vating factor, might well provide an alternative framework for studying fertility 
dynamics as part of an interdependent system that creates opportunities and 
costs for family welfare. 

Anequilibrium or homeostatic framework might be usefully applied to some 
of the anomalous patterns considered in the prior review of the literature (Lee 
1987). The first case is the wide variations of fertility in "natural fertility" 
populations. Recall that fertility responded in a systematic way to variations 
in economic conditions in premodern Europe via changes in marriage patterns 
(Wrigley 1966, Goldstone 1986). There are also systematic differences be- 
tween forager and agricultural populations (Gillian et al 1993) and between 
agricultural populations at different levels of density (Easterlin 1971, 1976a, 
Firebaugh 1982). The wide variations in intermediate variables in many tra- 
ditional (pre-fertility transition) populations (sexual abstinence, breastfeeding 
duration, rules on widow remarriage, patterns of male migration, divorce, etc) 
seem to be simply cultural curiosities unless we assume that these practices 
were social adaptations to regulate population size and growth. just as rules 
on marriage in premodern England were responses to economic cycles. Much 
of contemporary demographic wisdom considers these variations beyond the 
relevance of theories of fertility decline, which are limited to the origins of 
conscious control of marital fertility. Conscious patterns of fertility control are 
an important part of the process, but the theoretical framework of fertility 
transitions should be built on a broader base. ^ 

If the homeostatic principle is to maintain demographic equilibrium in order 
to avoid community and household strain, then changes in fertility are only 
one of several mechanisms that can respond to the rapid increases in population 
growth (as a result of reductions in mortality) that began in the eighteenth and 
nineteenth centuries in many European countries and around the globe in the 
twentieth century. The first response was probably migration: to agricultural 
frontiers, to cities, and to settler societies in the New World and Oceania. The 
two extreme cases in European demographic history—why did the fertility 
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decline begin so early in France and so late in England?—mzay be explicable 
in terms of differential opportunities for outward migration from rural areas. 
But migration was only a temporary palliative; the explosive population growth 
that resulted from declining mortality rates led to fertility reductions that 
eventually (within a century) resulted in a near demographic equilibrium in 
industrial countries. If current patterns continue, the same outcome is likely 
in the contemporary developing countries in the next 50 years or so. 

The wide variations in correlates of the speed of fertility reduction seem 
anomalous only if the "true theory" is fixed on a small number of master 
determinants of lowered fertility in all settings. If, on the other hand, the 
homeostatic principle is the central idea, the reduction of population growth 
by fertility decline could occur differentially across a wide variety of situa- 
tions—depending on the value and costs of children, the costs of fertility 
regulation, access to information, and many other conditions. While it is 
important to map the variations in timing of fertility transitions (including the 
initiation and the pace of change) and the immediate correlates of differential 
timing, the central point is that all of these revolutionary demographic changes 
spanned the globe in about a century. 

I am not entirely convinced that the alternative model of demographic 
homeostasis will lead to directions of day-to-day research fundamentally dif- 
ferent from theories of fertility decline. But it casts the central question rather 
differently. Rather than asking why fertility declines and expecting a simple 
answer, the alternative may be to ask, how are demographic equilibria rees- 
tablished after mortality declines? This question will require a much greater 
tolerance of complexity as there are certain to be many paths to this end. There 
may also be greater variations in demographic regimes at the conclusion of 
fertility transitions, but that is a story for another day. 
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Abstract 


This paper examines changes in marriage as an institution for rearing children 
in the United States. It reviews the effects of marital instability and living 
arrangements on children's welfare, and focuses on how children's economic, 
emotional, and social needs are met when parents separate. The review shows 
that changes in marriage and childrearing have different consequences for 
women and men. For women, marriage and parenthood are distinct institutions. 
Women provide for children's needs, whether or not the women are married 
to their children's fathers. For men, marriage defines responsibilities to chil- 
dren. At divorce, men typically disengage from their biological children. When 
men remarry they may acquire new children whom they help to support. The 
review describes the effects on children of divorced mothers' and fathers' 
varying commitments to childrearing. It considers the difficulties that divorced 
parents experience when they try to continue to share responsibilities for 
children after separation, and it suggests avenues for future research. 


INTRODUCTION: MARRIAGE AND CHILDREARING AS 
CHANGING INSTITUTIONS 


Increases in divorce and nonmarital childbearing in the United States reduce 
the centrality of marriage as an institution in which childbearing and childrear- 
ing occur. Over this century, parents have become more likely to divorce 
(NCHS 1989), even when they think divorce may not be in children's best 
interests (Thornton 1985). By 1990, the percentage of children born to unmar- 
ried parents had risen to 2896 (NCHS 1993). These trends have changed 
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children's living arrangements (Hernandez 1993). Married parents nearly al- 
ways live together with their minor children. When their parents' marriage 
ends, children live with only one parent. Although children may move back 
and forth between their mother's and father's homes, or acquire stepparents 
through parents' remarriage, children seldom again live with both of their 
original parents.! 

The coincidence of marriage and co-residence with parents means that any 
treatment of the effects on children of marital dissolution must also consider 
the effects of living in a single-parent household. Until the middle of this 
century, most children who experienced marital disruption did so because 
one of their parents had died. Improvements in life expectancy have dramat- 
ically reduced the chances that a child born today will lose a parent through 
death. Instead, children who experience the dissolution of their parents' 
marriage usually do so as a result of divorce and marital separation (Bane 
1976). Nearly a quarter of the children in recent birth cohorts will spend 
time in a single-parent household because of divorce. Another quarter will 
live with a single mother due to nonmarital childbearing. Among those born 
outside of marriage, one in four children actually begins life in a household 
with both biological parents, a pattern only recently evident from new 
information on unmarried cohabitation (Bumpass & Raley 1993). Thus, even 
children born outside of marriage may experience the break up of their 
parents’ relationship. These demographic trends suggest that the experiences 
of children born in and outside of marriage may be more similar now than 
in the past. Both married and unmarried parents may have short-term unions, 
and secular growth in the percentage of children who live with unmarried 
but cohabiting parents suggests that in the next decades even more children 
will experience the instability of parents' relationships (Sweet & Bumpass 
1987). 

When parents and children live together, parents provide for children's basic 
needs—their economic support, emotional security and attachment, and super- 
vision and training. Of course some parents who live with children do not or 
cannot provide adequately for their children, but in the contemporary United 
States, parents are primarily responsible for performing these functions. When 
parents live apart, as happens at divorce, children still have these needs. How 
divorced parents fulfill these needs is the central question of this review. 
Children who live apart from a biological parent provide an opportunity for 
sociologists interested in kinship to examine the behavioral components of 
parenthood. Studies of the behaviors that define parenthood assume that the 
rights and responsibilities of parents are socially defined, and that biological 


TT use the term “original” to describe both children’s biological parents and those parents who 
have adopted children at a very young agc. 
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ties to children are less important for understanding children’s life chances 
than are adults’ social ties to children. Despite obvious exceptions to this 
generalization, such as when genetic inheritance affects children’s physical 
health and cognitive abilities, the adults who care for children have a major 
impact on children’s well-being. Emphasis on parenthood as socially defined 
is consistent with the finding that children living with stepfathers are much 
more likely to say that their stepfather is a member of their family than they 
are to include their nonresident biological father as a family member (Fursten- 
berg & Nord 1985). 

Parents necessarily share material resources with young children who live 
with them. When parents and children live apart, researchers can investigate 
the relationship between economic investments in children and socioemotional 
ties between parents and children. The degree to which economic and social 
bonds overlap is a central concern in studies of extended families and of 
parents’ relationships with adult children (e.g. Adams 1970, Lee 1979, Hogan 
et al 1993). As more biological parents live apart from their young children, 
this question becomes relevant to studies of parents and minor children as well. 
(Seltzer et al 1989). In addition, because the state regulates divorce and some 
aspects of nonmarital childrearing, changes in laws and policies related to 
divorce are natural experiments on the causes of alternate kinship structures. 
These experiments are not only ivory-tower attempts to understand family 
institutions. Ideally, knowledge of how children fare in different family ar- 
rangements informs social policies to enhance the well-being of future gener- 
ations (Garfinkel & McLanahan 1990). 

Studies of the effect of marital dissolution on children abound. My review 
complements several recent reviews which consider the effects of marital 
dissolution and living with a single parent during childhood on individual 
well-being (e.g. Amato 1993, Amato & Keith 1991a,b, Chase-Lansdale & 
Hetherington 1990, Emery 1989, Furstenberg 1990, McLanahan & Booth 
1989, McLanahan & Sandefur 1994, Mott 1993). By focusing on parent-child 
relationships after divorce, this review describes how changes in family insti- 
tutions may exacerbate or ameliorate negative effects of marital dissolution on 
children's well-being. 

The separation of marriage and childrearing has different consequences for 
women and men. For women, marriage and caring for children are distinct 
institutions. Once women bear children, the children live with them and remain 
their responsibility, regardless of marriage, remarriage, or cohabiting relation- 
ships. For men, marriage and co-residence usually define responsibilities to 
children. Regardless of their biological ties to children, men share time and 
resources with the children of their wives or female partners. When marriages 
dissolve, men disengage from these children as they disengage from their 
former wives, and then men form attachments to new children through new 
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wives or female partners. [See Furstenberg & Cherlin (1991) for a similar point 
about marriage and childrearing as overlapping institutions for men.]? 

This review is organized as follows. First I briefly summarize past research 
on the effects of marital dissolution on children. This is followed by a discus- 
sion of how family relationships are reorganized when parents separate. I then 
describe the mechanisms by which parents who live apart provide for chil- 
dren's economic, emotional, and socialization needs, and consider how well 
these mechanisms work. I emphasize economic needs over emotional security 
and socialization because children's material hardship and economic welfare 
are the focus of a number of recent federal and state policy changes. Low 
income and financial strain also affect the emotional environment of families 
and parents’ ability to supervise children. The last section considers the im- 
plications of change in marriage and childrearing institutions for children and 
their mothers and fathers. 


EFFECTS OF MARITAL SEPARATION ON CHILDREN 


Children whose parents separate typically suffer disadvantages compared to 
children whose parents live together (Amato & Keith 1991b, McLanahan & 
Sandefur 1994). Recent reviews summarize evidence showing that children are 
emotionally distressed by parents' separation (Chase-Lansdale & Hetherington 
1990, Emery 1989, Furstenberg 1990). Young children, especially, are de- 
pressed and anxious, and they may feel torn by loyalties to both parents 
(Wallerstein & Kelly 1980b). Children's school behavior and achievement also 
suffer, and these disadvantages may have long-term effects, reducing rates of 
high school graduation and years of schooling completed (McLanahan & 
Sandefur 1994). Children who grow up in single-parent households also are more 
likely to be delinquent than their peers whose parents stay together (Matsueda & 
Heimer 1987, but see Zill et al 1993 for a null finding). Finally, some evidence 
suggests that girls from disrupted families are more likely to become single 
mothers themselves, compared to girls from families that have not been disrupted 
(Wu & Martinson 1993, McLanahan & Bumpass 1988). Children who are 
younger when their parents separate may be more seriously disadvantaged than 
those who are older at the time of disruption (Allison & Furstenberg 1989), but 
evidence on this is inconsistent (Furstenberg 1990). Compared to girls, boys may 
also be harmed more by parents' marital disruption (Hetherington et al 1985), 
although again the evidence is inconclusive (Zaslow 1988, 1989). 


Ironically, this change in the institution of marriage creates greater gender differences in family 
relationships at the same time that women and men have greater equality in the paid labor force than 
at any other time in the past century (O’Neill & Polachek 1993). These trends are related. Women’s 
gains in the paid labor force may be a source of the rise in marital instability (Purstenberg 1990, 
Goode 1993). 
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This characterization of the research on the effects of divorce ignores that 
marital disruption affects some children more than others. Some children show 
no ill effects. As in medical research on risk factors in illness, studies about 
the effects of divorce are useful for predicting differences between categories 
of people but cannot address directly whether a specific individual will be 
harmed by divorce. 

Despite general agreementthat in the aggregate there are short-term, and some 
longer-term, negative effects of marital separation on children, researchers often 
disagree about the magnitude and source of these differences. [See the review by 
McLanahan & Booth (1989) for a summary of recent debate about the effects of 
living with a single mother and the exchange between Amato (1993) and his 
critics for an example of conflict about the effects of divorce on children.] 

Debates about the extent of the disadvantage associated with divorce depend 
on whether the difference between children with married parents and those with 
separated parents is measured in absolute levels or in terms of relative risks. 
Consider the example of high school drop-out rates. About 29% of children from 
single-parent households drop out, compared to 13% of children from house- 
holds with both original parents, according to estimates from the National 
Longitudinal Survey of Youth (NLSY), which are adjusted for children's 
demographic characteristics (e.g. race, sex, region of country, siblings) and 
parents' education (McLanahan & Sandefur 1994). The absolute difference in 
drop-out rates is 1696, but the rate for children in single-parent households is 
twice as high as for children living with both parents. Absolute differences 
between the two groups of children are generally small across various outcome 
measures; however, the increased risk of negative outcomes for children from 
disrupted households may be very high relative to children from intact families 
(Zill et al 1993). 

Controversy about the effects of marital dissolution also reflects ambiguity— 
and sometimes explicit disagreement—about whether to consider the “cause” as 
divorce itself or the movement to a single-parent household and its associated 
decline in income (see below). This ambiguity is compounded in studies that do 
not distinguish single-parent households formed by divorce from those formed 
by widowhood or when children are born outside of marriage. Other sources of 
controversy include inattention to whether samples are restricted to children of 
recent divorce or whether they include children in both divorced and remarried 
families, and ambiguity about whether to include the conflict leading up to 
marital dissolution as a divorce effect. In addition, researchers often ignore the 
questions of whether and how to distinguish long-term family problems which 
may be a cause of divorce (e.g. strain in the parents’ relationship with each other 
or chronic economic hardship) from the effects of divorce itself. That many 
studies do not address these conceptual ambiguities and that authors sometimes 
describe their findings without attending to whether they address problems of 
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divorce or problems of living in a single-mother household exacerbate disagree- 
ment about the size and sources of effects on children of marital dissolution. 

Major explanations for an effect of marital dissolution on children empha- 
size the disruption or loss of elements which provide for children's basic needs: 
economic factors, emotional security and enduring bonds, and socialization 
and supervision. Despite disagreement about the extent to which each of these 
elements accounts for negative effects of marital dissolution on a child's 
well-being, these elements are crucial for the welfare of children no matter 
where they live or who their parents are (Maccoby 1980). 


REORGANIZING THE FAMILY 


At divorce, parents must reorganize their responsibilities for children. They 
decide where the children will live and who will be responsible for their daily 
care. Parents also decide who will be responsible for major decisions about 
the children. As time passes, children may acquire new family members 
through parents' remarriage. This section considers these three dimensions of 
family reorganization. 

Where Children Live 


Parents reallocate their responsibilities for children when their marriage breaks 
up. First they must decide where the children will live. Where and with whom 
children live indicate the physical setting and material resources to which 
children have direct access. Co-residence provides opportunities for the inti- 
mate interactions and for the adult supervision required for successful social- 
ization of children. Census data show that 8696 of families with a single parent 
were maintained by mothers in 1992. This is a slight decline from the previous 
20 years when mothers headed 90% of these households. Compared to whites, 
black single-parent households are more likely to be maintained by mothers, 
largely because blacks have higher rates of nonmarital childbearing (Rawlings 
1993). Nonmarital childbearing also accounts, in part, for the larger percentage 
of black children who will spend at least some time in a single-parent house- 
hold (Furstenberg et al 1983, Bumpass & Sweet 1989). For example, among 
children born in the 1970s, about 8096 of black children were likely to live in 
a single-parent household, compared to 3696 of non-Hispanic white children 
(Bumpass & Sweet 1989). 

Information from the census does not provide direct evidence about chil- 
dren's living arrangements when their parents divorce because the data are 
classified by parents’ current marital status rather than by reasons for living 
in a single-parent household. Children whose parents divorced and remarried 
are not identified routinely as living in stepfamilies (but see Moorman & 
Hernandez 1989). Similarly, parents who had a child out of marriage, and then 
married and divorced are treated as divorced, single parents. Large surveys of 
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recent divorces in California and Wisconsin show that most parents still assign 
primary physical custody to the mother (Maccoby et al 1988, Seltzer 1990). 
The Stanford Child Custody study, which followed divorced families in north- 
ern California for the first three-and-a-half years after separation, shows that 
two out of three families arranged for children to live with their mothers, and 
in another one out of six, children spend considerable time (4—10 days in a 
two-week period) in each parent's household. The remainder lived mostly with 
their father or in some other arrangement (Maccoby & Mnookin 1992). Be- 
cause the Stanford study uses a geographically restricted sample from a state 
known for its reforms to divorce law, these patterns may not reflect the typical 
experience of children whose parents divorce. National studies show that few 
US children split their time evenly between their mother's and father's house- 
holds (Furstenberg et al 1983, Seltzer 1990, 1991b). 

At divorce, boys are slightly more likely than girls to live with their father 
or to have dual residence, but the sex difference in residence patterns is most 
common for children aged 11 and older (Maccoby et al 1988). Debate about 
whether children benefit from living with their same-sex parent depends more 
on theoretical expectations about childrearing processes than empirical evi- 
dence. Theories about sex-role socialization and modeling suggest that boys 
may suffer more from a father's absence from the household than do girls. 
Boys living with divorced mothers do have more adjustment problems com- 
pared to girls, at least until their mother remarries, when girls’ problems 
increase (Hetherington et al 1985). Among children who live with their fathers, 
girls are less competent socially and more depressed than boys are (Warshak 
& Santrock 1983, Peterson & Zill 1986, Camara & Resnick 1988), and girls 
living with their fathers have lower grades and more adjustment problems than 
girls living with their mothers (Maccoby et al 1993). However, Maccoby and 
her colleagues (1993) do not find that boys are advantaged by living with their 
fathers. The very small numbers of father-custody families and the still selec- 
tive character of these families compared to mother-custody families suggest 
that these results be interpreted with great caution. Even more importantly, 
evidence that boys' difficult behavior predates divorce (Block et al 1988, 
Cherlin et al 1991), that married parents treat sons and daughters differently 
(Marsiglio 1991, Harris & Morgan 1991), and that boys may be more vulner- 
able than girls are to stress (Rutter 1983) casts doubt on whether living with 
a same-sex parent ameliorates negative effects on children of divorce. [See 
Zaslow (1988, 1989) and Stevenson & Black (1988) for reviews of sex differ- 
ences in children's adjustment to divorce. Hetherington and her colleagues 
(1983) also provide a valuable assessment of research on cognitive functioning 
and school achievement for boys and girls.] 

Parents may alter children's living arrangements during the early post- 
divorce period. Children who live with their mother after divorce are more 
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likely to retain this arrangement during the first three to four years after 
separation. In contrast, over half of children who start out by spending time 
in each parent's household (i.e. dual residence) or who start by living mostly 
with their father make at least one change (Maccoby & Mnookin 1992). 
Residential instability is associated with adjustment problems among adoles- 
cents, but the direction of causation is ambiguous (Maccoby et al 1993). 
Whether or not parents separate, children who move frequently during child- 
hood are less likely to complete high school (Haveman et al 1991). 


Legal Parenthood 


In addition to deciding where children will live, parents negotiate legal custody 
which establishes the formal right to make decisions about children's lives. 
Although legal and physical custody (residence) usually coincide, an important 
exception is when parents share authority over children through joint legal 
custody, but the children spend most of their time in one parent's household. 
The most common instance is when parents share joint legal custody but 
children spend most of their time with the mother (Seltzer 1990, Maccoby et 
al 1988). How parents divide decision-making rights may affect the pattern of 
postdivorce parenting and children's well-being. The rationale for joint custody 
is to make children's lives after divorce as similar to the lives of their non- 
divorced counterparts as possible (Wallerstein & Blakeslee 1990, Felner & 
Terre 1987). Compared to fathers in families in which mothers have sole legal 
custody, nonresident fathers with joint legal custody of the children spend 
more time with the children (Wolchik et al 1985, Kline et al 1989) and may 
be more likely to comply with child support awards (Pearson & Thoennes 
1988, but see Seltzer 1991a and Albiston et al 1990 for contrary evidence). 

Parents of higher socioeconomic status are more likely to acquire joint legal 
custody (Koel et al 1988, Seltzer 1990), and their greater resources may explain 
the higher levels of postdivorce involvement of fathers with joint legal custody 
compared to those without custody (Seltzer 19912). Other family characteris- 
tics that are more difficult to observe, such as the predivorce relationship 
between spouses or parents' commitment to children, may also explain a 
positive association between joint legal custody and postdivorce paternal in- 
volvement, but the evidence on this is mixed. Although some studies suggest 
that fathers who are heavily invested in childrearing before divorce continue 
their involvement after separation (e.g. Lowery 1986), evidence from other 
research suggests a lack of continuity between pre- and postdivorce involve- 
ment of fathers (Hetherington 1993, Wallerstein & Kelly 1980a, Maccoby & 
Mnookin 1992). Research on legal custody and continuity in divorced parent- 
child relationships has been hampered by small, select samples and a reliance 
on retrospective reports of predivorce relationships. 

We know little about whether legal custody arrangements after divorce vary 
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by race, beyond the association between race and socioeconomic status. Most 
research on legal custody uses samples that are all or predominantly white 
families. A reason for this is that blacks are less likely to formally divorce 
than are whites (Sweet & Bumpass 1987); therefore state-based studies which 
sample divorce court records include few black families. However, high rates 
of informal marital separation among black parents and the disjuncture of the 
institutions of marriage and childrearing suggest that studies of dual residence 
should focus more on race and ethnic differences in children’s living arrange- 
ments and their consequences (del Carmen & Virgo 1993). 


Acquiring New Parents 

Parents’ remarriage alters the family organization of children again. Many 
children spend only a few years in a single-parent household before they 
acquire a stepparent. Taking account of nonmarital cohabitation, half of chil- 
dren spend fewer than four years living with a single mother before acquiring 
a stepfather. Black children are less likely than whites to experience their 
mothers’ remarriage or informal cohabiting union, but for both black and white 
children, living in a household with a lone parent is often a short-term arrange- 
ment (Bumpass & Raley 1993). Although four years is a short period in an 
absolute sense, it is a significant proportion of a child’s lifetime. Just as the 
circumstances of divorce vary considerably, both the pace of remarriage and 
children’s reactions to it show substantial variation. However, acquiring a 
stepparent generally reduces the economic problems of divorce, but differences 
in well-being persist between children who live with two original parents and 
those whose parents have been divorced (Hetherington & Clingempeel 1992, 
Dawson 1991). 

Grandparents also take on new family roles at divorce (Cherlin & Fursten- 
berg 1986). About 16% of children will live with their single mother and 
grandparents, according to estimates for children born in the 1980s (Bumpass 
& Raley 1993). Grandparents may also provide daycare for children, resulting 
in children spending considerable time in the grandparents’ home, even though 
the children live with their parent(s) (Jendrek 1993), Black children of single 
mothers are more likely than white children to live with their grandparents, 
but this is largely due to the higher proportion of black children whose mothers 
have never been married (Hernandez 1993: Table 3-10). The sections below 
consider how stepparents and grandparents fulfill children’s needs. 


MARITAL DISRUPTION AND CHILDREN’S ECONOMIC 
NEEDS 


Money cannot buy happiness, but it is difficult for children to be happy if they 
do not have enough to eat and a safe place to live. This section considers the 
economic disadvantages of children whose parents separate, and it reviews 
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evidence about parents’ economic contributions to children who live apart 
from them. 

Compared to children who grow up in two-parent households, those who 
live with single mothers are more likely to be poor (Duncan & Rodgers 1991). 
Even if they are not poor, children's family income declines by 3796 when 
their father moves out of the household. In as much as this reduced income is 
shared by fewer people, children whose parents separate experience about a 
21% reduction in per capita income. This income loss essentially remains stable 
through the first year after separation (Bianchi & McArthur 1991). Losing the 
father's income is not the only source of economic disadvantage of children 
of divorce. Parents who separate and divorce have lower incomes and fewer 
assets prior to divorce than do parents who remain married (Bianchi & Mc- 
Arthur 1991, Duncan & Hoffman 1985, McLanahan & Sandefur 1994). Once 
parents separate, they must bear the expense of two households instead of one, 
and this limits the financial resources available to both households, regardless 
of where the children live. 

Economic differences explain a large portion of children's educational dis- 
advantage when they live in single-parent households (Guidubaldi 1988, Krein 
& Beller 1988, McLanahan 1985, McLanahan & Sandefur 1994). Low income 
also explains part, but not all, of the differences in other outcomes, such as a 
young woman's chances of becoming pregnant and being a single mother 
(Hogan & Kitagawa 1985, McLanahan 1988). 

Evidence about the importance of economic resources for explaining effects 
of marital dissolution on child well-being is more limited than might be 
expected given the general acknowledgement that financial hardship accom- 
panies divorce. Few studies have longitudinal data on income before and after 
divorce. Some studies treat parents' education or occupational status as mea- 
sures of predivorce economic status. Education and occupation are associated 
with income, but may also affect child well-being through childrearing atti- 
tudes and practices (e.g. Kohn 1978, Alwin 1984). Conceptual issues also 
require further attention. Arguments about economic hardship have four com- 
ponents: (1) low absolute level of economic resources; (ii) abrupt decline in 
resources after separation; (iii) disparity between the incomes of mothers and 
fathers; and (iv) instability in income after divorce. The studies above address 
the problem of low absolute levels of income. Clinical research, such as the 
study conducted by Wallerstein, Kelly, and their colleagues, in which divorced 
children and their parents have been followed for over 15 years (Wallerstein 
& Kelly 1980b, Wallerstein & Blakeslee 1990), shows that children miss the 
material comforts of their predivorce homes, and that they resent their nonres- 
ident fathers having better living conditions than they and their mothers have 
(Wallerstein & Huntington 1983). 

Direct evidence about the effects of income instability is even more sparse, 


MARITAL DISSOLUTION AND CHILDREN 245 


although the associations between economic deprivation and negative life 
events, such as residential moves, and between economic hardship and tense 
emotional environments are well documented (McLoyd 1990, Elder et al 
1992). The lack of valuable assets held by divorced parents suggests that most 
divorced families do not have "nest eggs" on which to rely in emergencies 
such as illness or unemployment. Single mothers have trouble obtaining credit 
after divorce (Arendell 1986, Kitson 1992). In addition, because single parents 
are more likely to live in poor neighborhoods (McLanahan et al 1991) and to 
have family and friends who also have low incomes, it is probably more 
difficult for single mothers to borrow money from members of their social 
networks than it is for married families. Nonetheless, to my knowledge, there 
are no direct tests of income instability as a source of disadvantage for children 
whose parents divorce. Studies do consider mothers' perceptions that they have 
economic problems and whether the problems have gotten worse. These studies 
show that financial problems reduce mothers' psychological welfare (e.g. 
Menaghan & Lieberman 1986), and mothers' emotional adjustment has strong 
effects on childrearing behavior and may reduce the quality of parent-child 
interactions (McLoyd 1990, Hetherington et al 1982). Uncertainty about in- 
come, such as when mothers are able to obtain only sporadic employment or 
when contributions from the nonresident father are irregular, may threaten the 
emotional security of children's environment (see below). 


Economic Needs and Fathers’ Contributions 


Children who live with single mothers rely primarily on her earnings for their 
material needs. However, single mothers earn considerably less than married 
fathers do because mothers bear greater childcare responsibilities, have less 
paid work experience and training (related to their higher investment in child- 
care), and suffer persistent labor market discrimination. Mothers' lower earn- 
ings are not compensated for either by private transfers from fathers in the 
form of child support, or by public subsidies, such as Aid to Families with 
Dependent Children (AFDC) (Garfinkel & McLanahan 1986). This section 
describes nonresident fathers' contributions to children's economic needs, and 
considers methodological problems of measuring these contributions. 


CHILD SUPPORT PAYMENTS At divorce parents reallocate financial responsi- 
bilities for children. These responsibilities are codified in child support awards, 
which are ratified by the court. In 1989, about two thirds of ever-divorced 
mothers had child support awards which required that the nonresident father 
pay child support (US Bureau of the Census 1991: Table C). Of these, three 
quarters actually received any child support in the past year. Taken together, 
this means that about half of previously married mothers did not receive any 
child support. The average annual payment among those who received any 
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support was $3155.3 Data from the Current Population Survey assume that 
when families do not have child support awards, nonresident fathers make no 
financial contributions to their children. However, information from the 1987— 
1988 National Survey of Families and Households (NSFH) suggests that 
among families in which parents were divorced or maritally separated, about 
one in four fathers without awards made informal child support payments to 
the children's mother. Among those who made informal contributions, the 
median annual payment was about $1200 (JA Seltzer, unpublished data). In 
addition to child support, nonresident fathers sometimes pay for clothes for 
children, buy presents for them, pay for medical expenses, or make other 
contributions to children's material welfare. These transfers probably do not 
compensate for the low levels of child support payments. More than half of 
fathers buy presents for their children, but most do not provide other contri- 
butions outside of child support, according to a recent wave of the National 
Longitudinal Survey of the High School Class of 1972. Fathers who are already 
paying child support are more likely to provide extra support, which is contrary 
to the notion that informal transfers from fathers may substitute for payments 
required by child support awards (Teachman 1991). We know very little about 
how much divorced nonresident fathers contribute to children's college ex- 
penses, and divorce settlements infrequently address this issue (Wallerstein & 
Corbin 1986). 

The longer parents have been separated, the less likely nonresident fathers 
are to pay any child support, and the smaller their payments are (Beller & 
Graham 1991, 1993). A father's ability to pay support, including whether he 
is employed (Braver et al 1993, Peters et al 1993), his income (Seltzer 1991a), 
and education (Peters et al 1993) affect his compliance with formal child 
support awards, although neither high nor low income fathers have a strong 
record of compliance (Weitzman 1988). Evidence about whether fathers’ re- 
marriage reduces payments is mixed, possibly due to differences across studies 
in sample definition and the use of statistical controls for the positive associ- 
ation between economic status and fathers’ remarriage (Hill 1992, O’Neill 
1985, Seltzer 1991b). Studies that focus on compliance with formal child 
support awards may paint too bleak a picture of fathers’ behavior. Peters and 
her colleagues (1993) argue that a more reasonable approach takes account of 
whether divorced parents informally modify their child support awards to 
reflect changes in where the children live (e.g. when the children spend more 


3These estimates are weighted averages combining divorced and remarried mothers. They 
overestimate child support for all maritally separated women because they exclude awards and 
payments from those who were separated but not formally divorced. Informally separated mothers 
are less likely to have awards, and they receive less support than do divorced mothers (US Bureau 
of the Census 1991). 
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time in the father’s household than originally planned), shifts in children’s and 
parents’ needs, and fluctuations in the father’s ability to pay support. Such 
modifications occur outside the legal system, and so are less easily enforced 
by the state, but the changes allow divorced parents to reorganize their child- 
care arrangements without the expense of lawyers and court fees. 

Longitudinal data from California show that between 12% and 2496 of 
recently divorced families report informal modifications in their child support 
awards. When these informal modifications are taken into account, child sup- 
port compliance is slightly higher, 7696 compared to 7196 for compliance with 
formal awards. The effects on compliance of fathers’ unemployment and 
changes in children's residence are attenuated when informal modifications 
are taken into account. Peters and her colleagues interpret the informal ar- 
rangements as self-regulating, and they suggest that informal negotiation oc- 
curs between parents capable of working together for their children's best 
interests. Although the authors do not report amounts of child support paid, 
that compliance with informal arrangements is higher suggests that informal 
modifications require smaller transfers between nonresident fathers and resi- 
dent mothers. Whether this is actually in children's economic interests depends 
on how much the informal reduction in support owed is attributable to in- 
creased time in the father's household when he spends money directly on 
children. 

Because informal changes result from negotiations between a mother and 
father who differ in their economic interests and abilities to achieve these 
interests, informal modifications may be more costly for the weaker parent 
than if negotiations occur in a formal setting with legal protections (Seltzer & 
Garfinkel 1990). A concern with unequal power is relevant for bargaining 
about changes that one parent wants because of a choice he or she has made. 
For instance, a nonresident parent may ask to reduce child support payments 
to pay for a car. Reducing the amount of child support owed when a nonresident 
father loses his job, however, is a likely outcome of either formal or informal 
negotiations. 


METHODOLOGICAL ISSUES AND CHILD SUPPORT Most studies of child support 
focus on families in which children live with mothers and nonresident fathers 
have child support responsibilities. This has been reasonable because, as noted, 
children usually live with their mothers after divorce. However, the growing 
percentage of children who live with their fathers after separation has moti- 
vated the US Census Bureau to change their procedures to ask both mothers 
and fathers about their child support arrangements. Information from Wiscon- 
sin divorce and paternity cases from the mid-1980s shows that, compared to 
nonresident fathers, nonresident mothers are much less likely to have formal 
child support awards (8296 vs. 3096, respectively). Nonresident mothers also 
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pay less child support than do nonresident fathers (Meyer & Garasky 1993). 
That nonresident mothers pay less child support than nonresident fathers re- 
flects women's lower incomes and is consistent with child support guidelines 
that take account of whether the obligor has sufficient income on which to 
live (Garfinkel 1992). Amounts of child support awarded may also depend on 
the resident parent's income. When resident parents are better able to support 
the children on their own, the nonresident parent's child support award may 
be lower (Seltzer & Garfinkel 1990). Children who live with single fathers are 
better off economically than those who live with single mothers because of 
their fathers' higher earnings (Bianchi 1994), and this explains, in part, why 
nonresident mothers have lower child support obligations. The lower payments 
of nonresident mothers also might reflect that children are still unlikely to live 
with their fathers. Exceptions may include a disproportionate number of in- 
stances in which the mother is unable to care for the children due to her poor 
physical or mental health. 

Measurement of child support awards and payments has also relied primarily 
on reports from resident mothers, instead of nonresident fathers, even though 
divorced parents differ in their perceptions and interests in child support 
exchanges. Studies that compare nonresident fathers' reports of payments to 
resident mothers' reports about how much child support they receive show 
that fathers report paying more than mothers say that they receive (Schaeffer 
et al 1991, Braver et al 1991, Sonenstein & Calhoun 1990). Differences in 
parents’ reports are attributable to lower rates of survey participation among 
nonresident fathers than among resident mothers (Seltzer & Brandreth 1994, 
Schaeffer et al 1991, Cherlin et al 1983). Fathers’ reports may also be higher 
because they know more about payments made to a third party such as for 
doctors’ visits, gifts, or visitation expenses, because fathers may give money 
directly to the children, or because fathers overreport payments to cast them- 
selves in a favorable light. 


Persistent Economic Disadvantage: What Can Fathers Do? 


Child support transfers, formal and informal, are too low to alleviate the 
economic hardships of living in a single-mother household. Formal awards are 
less than the cost of raising a child, and many nonresident parents do not 
comply with their awards (Weitzman 1988). Informal modifications in awards 
suggest that when fathers are less able to pay support, their contributions 
decline—and mothers accept these declines. Certainly, ability to pay is central 
to whether or not fathers can help meet children’s economic needs after 
separation. Both simulation studies and those that rely on direct observations 
of changes in nonresident fathers’ incomes as their children age suggest that 
fathers can afford to pay more child support than they currently pay (Hill 1992, 
Garfinkel & Oellerich 1989, Phillips & Garfinkel 1993). Men's earnings in- 
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crease as they age and acquire additional labor force experience. Even men 
who remarry or father additional children have sufficient income to share more 
with their nonresident children, although simulations suggest that a small 
number of divorced fathers cannot afford to support two households (Hill 
1992). Increasing the child support payments of nonresident fathers, however, 
would not eliminate childhood poverty (Hill 1992, Garfinkel & McLanahan 
1986). 

For those who have sufficient income to pay more child support, the key 
question is: Why are their child support transfers so low? Fathers who live 
apart from their children miss many of the benefits of paternity. Almost by 
definition, when fathers leave the household, they spend less time with their 
children. At divorce, the expense of setting up two separate households makes 
both parents feel pinched financially. Fathers face the additional tasks of 
building new ways of staying involved with their children (Fox & Blanton 
1994). This task is complicated by the fact that contact with children, especially 
younger children, must be managed through their mother, and this interaction 
may be emotionally painful for both parents and conflict-ridden (Lund 1987). 
Ambiguity about how divorced fathers should care for their children also limits 
fathers’ ability to participate in childrearing after separation (Seltzer 1991b). 
When fathers have less control over childrearing and are unable to enjoy their 
children, they are less willing to “pay” for them, or contribute child support. 
In addition, fathers who live apart from their children feel uncertain about 
whether their child support contributions directly benefit the children, or 
whether the resident mother, their ex-wife/partner, is keeping the money for 
herself. Nonresident fathers are reluctant to give money to their ex-partners, 
even when the fathers care deeply about their children. Their inability to 
monitor expenditures on children may lower fathers’ child support payments 
(Weiss & Willis 1985). Fathers who feel less in control of their postdivorce 
childrearing arrangements are less likely to pay child support to the children’s 
mother (Braver et al 1993). In-depth interview studies also provide evidence 
of fathers’ concern about the extent to which child support contributions 
actually benefit children as opposed to ex-wives (Sherwood 1992, Arendell 
1992). 

Direct evidence about how divorced mothers spend child support money is 
limited. However, Del Boca & Flinn (1994) use US Consumer Expenditure 
Survey data to show that, compared to income from other sources, mothers 
spend a higher percentage of child support income on goods used only by 
children (e.g. clothes). How to divide expenses for goods such as rent and 
utilities that are shared by children and their parents is the subject of debate 
(see, for example, Lewin/ICF 1990). Other topics for future research include 
whether mothers use child support money to purchase things for themselves, 
how much nonresident fathers know about how child support money is spent, 
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and whether fathers' knowledge of childrearing expenses and mothers' expen- 
ditures is accurate. 


Who Meets Children's Economic Needs if Fathers Do Not? 


At separation, mothers increase their involvement in the paid labor force. Some 
re-enter after not participating for a while; others increase the number of hours 
that they work for pay (Peterson 1989, Bianchi & McArthur 1991). Public 
subsidies also help support children whose parents separate. About 22% of 
mothers and their children receive AFDC a year after separation (Bianchi & 
McArthur 1991), but welfare payments are generally low, and in-kind pro- 
grams such as food stamps and medicaid do not bring children in single-mother 
households in parity with children whose parents have not separated (Garfinkel 
& McLanahan 1986). Finally, stepfathers are an important source of children's 
income. Incomes of households in which children live with stepfathers are 
similar to those in households with two original parents, according to NSFH 
estimates (Thomson et al 1994). We lack information about whether stepfathers 
share their incomes exactly the way biological fathers share with their co-res- 
ident children. Absence of information about intrahousehold resource alloca- 
tion characterizes most studies of US families and households, despite its 
importance for theories about relationships between generations and about 
power and inequality in relationships between women and men. Children who 
live with stepfathers generally benefit from these men's investments in im- 
proved housing and neighborhood location and from stepfathers' contributions 
to daily needs. It is not clear how much, if at all, stepfathers contribute to 
children's educational expenses after high school or the other costs of leaving 
home and setting up a new household. That remarriages are unstable and often 
last less time than first marriages (Cherlin 1992) suggests that although step- 
fathers may help meet children's material needs in the short term, they do not 
provide long-term relief from the economic hardship of divorce. As noted 
above, maternal grandparents may provide for children when divorced mothers 
move in with them. However, this arrangement is also a short-term solution 
to economic needs (Bumpass & Raley 1993). 


Fathers vs Others as Economic Providers 


Do children whose divorced fathers pay child support benefit from this transfer, 
beyond the direct benefits of having a higher income? This is a very restricted 
version of a question commonly raised about the effects of welfare payments 
on child well-being, where the contrast is between public and private respon- 
sibility for children. The effects of AFDC and other welfare programs on 
children and their families are well beyond the scope of this review, but it is 
important to consider whether dollars that fathers contribute in child support 
are worth more in children's well-being than dollars of income from some 
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other source. Several studies show a positive association between the amount 
of child support that nonresident fathers pay and children's good behavior 
(Furstenberg et al 1987) and school achievement (Graham et al 1994, King 
1994, Knox 1993, McLanahan et al 1994, Knox & Bane 1994). Child support 
payments also have a larger impact on children's well-being than other house- 
hold income (Beller & Graham 1993, Graham et al 1994, Knox 1993, Knox 
& Bane 1994, McLanahan et al 1994). These studies use data from a wide 
variety of surveys, including the National Survey of Children (NSC), the 
Current Population Survey, the NSFH, the Panel Study of Income Dynamics, 
and the NLSY. A few differences occur across studies in the association 
between payments and child well-being. For example, Furstenberg and his 
colleagues (1987), using data from the NSC, find no effect of payments on 
school performance, and King (1994) and Knox (1994), using NLSY data, 
find no effect on behavior problems. Nevertheless, the generally consistent 
finding that child support payments are associated with positive child outcomes 
suggests reasonably high reliability across different sampling units (children 
vs families), statistical techniques, and measurement of the key constructs of 
child support and child well-being. 

The benefits of child support payments, as distinct from other income, may 
reflect unmeasured differences between families in which fathers do and do 
not pay child support. Child support payments have a symbolic importance to 
children who view these financial contributions as an indication that their father 
cares about them. Clinical evidence suggests that when fathers pay support 
regularly, children are less likely to feel rejected by their fathers (Wallerstein 
& Huntington 1983). Fathers who pay support also see their children more 
frequently (Furstenberg et al 1983, Seltzer et al 1989, Seltzer 1991b, but see 
Veum 1993), and the combination of paying support and spending time with 
children may reassure children about their father's love. * Child support also 
may benefit children directly if mothers spend more on children than when 
income comes from other sources (Del Boca & Flinn 1994). Alternatively, 
parents of children in families with regular child support may get along better, 


“Paying support and visiting may both result from fathers’ commitment to children. They may 
also be causally related. Fathers who contribute more child support may spend more time with 
children as a way to monitor how the mother spends the child support. Spending time with children 
also provides information about children’s material needs, and the information may increase fathers’ 
financial contributions. Findings from cross-sectional data do not support a causal interpretation, 
but cross-sectional surveys have obvious disadvantages for assessing reciprocal effects (Seltzer et 
al 1989). A stronger test comes from Veum's (1993) analysis of longitudinal data. He shows that 
changes over short periods of time in child support payments and in the amount of time fathers spend 
with children are independent. Replicating this result for families who have been seperated for a 
longer time is important for understanding the association between paying support and visiting 
children. 
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and the lack of conflict may explain both higher support payments and chil- 
dren's better adjustment. When fathers pay more child support, children are 
less likely to move frequently, and they are more likely to stay in the same 
school district (Wallerstein & Huntington 1983). The lack of disruption in their 
educational careers may account for their better behavior in school and higher 
achievement. 

Evidence on whether child support itself explains children's enhanced wel- 
fare is suggestive at best. McLanahan and her colleagues (1994) find that the 
benefits of child support for children's school performance persist when re- 
ported conflict between parents is statistically controlled. The effects of pay- 
ments are also positive, regardless of the amount of time that nonresident 
fathers and children spend together (King 1994, Knox 1993, McLanahan et al 
1994). In addition, some studies use statistical techniques to try to take account 
of unmeasured differences between families, such as the quality of parent-child 
relationships, which may explain both child support payments and children's 
School performance (Knox 1993, McLanahan et al 1994). Support payments 
continue to have a positive effect on children's behavior in school and their 
achievement, taking account of unmeasured differences between families. This 
effect is statistically significant at conventional levels in Knox's analysis, but 
not in that conducted by McLanahan and her collaborators. 

Differences between the two studies in the populations sampled, the use of 
parents' reports about conflict in their relationship, the quality of the data used 
to correct for unmeasured effects, and the measurement of children's well- 
being may explain why the effect of support is significant in one study but not 
the other. Both studies use a similar statistical procedure, an instrumental 
variables approach in which predicted child support payments are substituted 
for observed payments to control for unmeasured differences between families. 
This procedure is often sensitive to the choice of variables used in the predic- 
tion equations. Nevertheless, the consistency of the positive effect of child 
support payments on children's well-being across studies, including those that 
use direct and indirect methods of taking account of conflict between parents, 
suggests that more universal and rigorous child support enforcement may 
enhance children's welfare. Understanding how child support payments affect 
children's adjustment to parents' separation and living in a single-parent house- 
hold is critical for an evaluation of federal and state reforms in the child support 
system. 


MARITAL DISRUPTION AND CHILDREN'S 
EMOTIONAL NEEDS 


Just as children must be physically safe and have enough to eat, they also 
require emotional sustenance. The process of parents' separation and divorce 
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threatens children's emotional security through conflict between parents, loss 
of daily contact with one parent, and disruption of routines. Anxiety, depres- 
sion, and disruptive behavior characterize children's short-term adjustment to 
divorce (Chase-Lansdale & Hetherington 1990). Threats to children's emo- 
tional stability may also explain longer-term disadvantages, but the evidence 
on this is limited, due to the methodological demands of considering simulta- 
neously time since separation, child's age at separation, and child's current 
age (Chase-Lansdale & Hetherington 1990). The National Survey of Children 
(NSC), which has followed children from mid childhood to early adulthood, 
provides preliminary evidence that negative effects of parents' separation on 
emotional adjustment persist (Zill et al 1993). Children's developmental stage 
and their social resources, including access to support from teachers and peers, 
affect their emotional response to divorce (Wallerstein & Kelly 1980b, Hether- 
ington et al 1982). Regardless of their age, and whether they are boys or girls, 
children who have a warm and stable relationship with their parents do better 
than children who do not. [See Maccoby & Martin for a review (1983).] This 
section considers briefly the threats to children's emotional security when 
parents divorce, and it addresses the question of how children's emotional 
needs are met after divorce. 


Threats to Children's Emotional Security 


Most US children anticipate growing up in homes with both their mother and 
their father. They view themselves as members of a family defined by who 
lives together, in part, because US culture recognizes the close correspondence 
between family membership and co-residence, and because, for younger chil- 
dren, their cognitive abilities make it difficult for them to understand deviations 
from what they have learned is "the way things are" (McGurk & Glachan 
1987). When a parent leaves the household, children experience a tremendous 
sense of loss and anxiety about who will take care of them (Wallerstein & 
Kelly 1980b). 

Conflict between parents harms children. This finding is well-established, 
although the mechanisms through which conflict affects child well-being are 
less well understood (Emery 1982, Camara & Resnick 1988, Depner et al 
1993). Children whose parents ultimately divorce are exposed to parents' 
marital conflict before they decide to separate, during separation as parents 
argue about how to reorder their own and their children's lives, and, often, 
after divorce when parents disagree about childrearing matters such as the time 
each parent spends with the child and child support. That this conflict accounts 
for at least some of the emotional disadvantages of children whose parents 
divorce is consistent with evidence from national surveys which shows that 
children whose parents are married but have high conflict relationships have 


254 SELTZER 


similar adjustment problems to children who live in single-mother households 
(Hanson 1993, Peterson & Zill 1986). 

Conflict between parents affects children directly but also has an indirect 
effect through childrearing practices and the quality of parent-child relation- 
ships. Parents who are preoccupied with their own problems may be inattentive 
to children's needs. Particularly during the trauma of divorce, parents may be 
unable to comfort their children because of their own pain. Parents sometimes 
call on children to reassure them, asking children to bear the responsibility for 
their own and their parents' adjustment to marital separation (Wallerstein & 
Blakeslee 1990). When families are followed as their children grow up, com- 
parisons of children whose parents stay together to those whose parents divorce 
show that differences in parent-child relationships exist well before divorce 
occurs. Compared to parents in families who stayed together, those who 
divorced portrayed themselves and each other as tense, preoccupied, critical 
and unsupportive in the years prior to the divorce (Block et al 1988). This 
evidence, along with the longitudinal studies by Cherlin and his colleagues 
(1991) and by Baydar (1988), suggests that the disruption of children's emo- 
tional security predates the disruption of divorce. 

Finally, the stress of the many changes in household and family relationships 
which accompany divorce threaten children's emotional security. Divorce is 
often accompanied by residential moves, changes in mother's employment and 
childcare arrangements, and disruption of the household’s daily routine. Res- 
ident mothers' childrearing practices, especially discipline and control, become 
more erratic in the year or two following marital separation (Hetherington et 
al 1982). Disruption in the way mothers interact with children also threatens 
children's emotional security and attachment. Children who experience mul- 
tiple transitions in their family arrangements as parents divorce, cohabit and 
remarry, and then separate again may be particularly disadvantaged in their 
emotional and social adjustment (Amato & Booth 1991, Furstenberg & Seltzer 
1986, Wu & Martinson 1993). Stress interpretations focus on short-term in- 
terruptions. To the extent that children, and their mothers, restabilize their 
lives, emotional security may be restored. The disruption of childrearing prac- 
tices, however, may have longer-term effects on socialization, an issue con- 
sidered below. 


Fathers and Children's Emotional Security 


Separation from a parent may have lasting consequences for children's attach- 
ment (Rutter 1981). But when fathers leave the household at divorce, they may 
still be involved in children's lives. Perhaps because of their own emotional 
pain and the difficulty of knowing how to behave with their children after the 
divorce (Lund 1987), many fathers withdraw from children. National surveys 
of families in which children live with their mother after the divorce show that 
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a large percentage of children see their fathers infrequently if at all. About one 
in five divorced fathers has not seen his children in the past year, and less than 
one out of every two fathers sees his children more than several times a year, 
according to estimates from the NSFH (Seltzer 1991b). The 1981 wave of the 
NSC and the 1981 Child Health Supplement to the National Health Interview 
Survey shows even less contact between fathers and children who live apart, 
but these estimates include both children of divorce and those born to unmar- 
ried parents, a group that has less frequent contact (Furstenberg et al 1983, 
Seltzer & Bianchi 1988). The longer divorced fathers and their children have 
lived apart the less contact they have with each other (Seltzer 1991b). However, 
longitudinal studies of the process of fathers’ disengagement show that this 
aggregate pattern masks fluctuation in patterns of contact as children grow up. 
Furstenberg & Harris (1992) and Hetherington (1993) report that fathers and 
children may reestablish contact with each other during adolescence. Never- 
theless, the overwhelming pattern is for most divorced fathers to stop seeing 
their children, and for those who maintain contact, to reduce the frequency of 
visits. A father’s remarriage also reduces contact with children (Furstenberg 
et al 1983, Seltzer 1991b), perhaps because the father’s new family obligations 
pull him away from his children or because of strains in the relationship 
between his new wife and his children from a previous marriage. 

To the extent that children live with their fathers after divorce, these figures 
overstate fathers’ disengagement from children. As noted above, however, only 
a small percentage of children live with their father after divorce. Mothers are 
more likely to maintain contact with children living with their father than 
fathers are when the children live with their mother (Furstenberg et al 1983, 
Seltzer & Bianchi 1988). However, among families in which the nonresident 
parent has at least some contact with the children, mothers and fathers are 
about equally likely to see the children weekly or more frequently (Seltzer & 
Bianchi 1988). 

Notwithstanding evidence of most fathers’ disengagement, some fathers 
maintain a significant presence in children’s lives after divorce. Nearly a 
quarter of nonresident fathers see their children at least weekly, and more than 
a third of fathers who see their children also spend extended periods of time 
with them in visits lasting longer than a weekend (Seltzer 1991b). Fathers who 
see their children frequently also maintain contact by telephone and mail. 
When nonresident fathers do not see their children, they are unlikely to com- 
pensate for the lack of face-to-face contact by telephone calls (Furstenberg & 
Nord 1985, Seltzer 1991b). As noted above, frequent contact is also associated 
with paying more child support. 

Children describe their relationships with nonresident parents as close, even 
when they have very little contact (Maccoby et al 1993). Many fathers and 
children desperately want to maintain contact (Wallerstein & Blakeslee 1990), 
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but evidence of the salutary effects of paternal involvement after divorce is 
mixed. Whereas small studies that emphasize psychological outcomes and the 
quality of parent-child interactions find a positive effect of fathers' involve- 
ment on child well-being (Hess & Camara 1979, Lund 1987, Wallerstein & 
Kelly 19802), large national surveys consistently show an absence of associ- 
ation between nonresident fathers’ visits and children's well-being (King 1994, 
Furstenberg et al 1987, McLanahan et al 1994). Yet other studies such as those 
by Hetherington and her colleagues (1982) and by Amato & Rezac (1993) find 
that the effects of fathers' involvement depend on the quality of parents' 
relationship with each other and with their children. When divorced parents 
get along well with each other, contact with the father benefits children. This 
finding, however, is another that is inconsistent across studies, even when 
investigators use data from the same survey. For instance, Amato & Rezac 
find an effect of conflict on the association between contact and child well- 
being, but Hanson (1993) does not find this interaction. Differences across 
studies may result from alternate definitions of conflict and child outcomes. 

Conflict between parents often persists after divorce, further threatening 
children's emotional security (Johnston et al 1989). Parents share an interest 
in their children's welfare, and so they cannot make the “clean break" that is 
possible for nonparents who end their marriage. Money and childrearing prac- 
tices are a major source of disagreement among married parents (Hanson 
1993), and these issues are even more difficult for divorced parents who 
already have a history of conflict at the same time they face ambiguous role 
responsibilities (Wallerstein & Huntington 1983, Seltzer 1991b). Divorced 
parents may respond to the challenges of postdivorce childrearing by disen- 
gaging from each other and managing their childrearing responsibilities in 
parallel households (Furstenberg & Nord 1985) or by remaining engaged with 
each other, either in reasonable harmony or in hostility. Over time, however, 
disengagement becomes more common (Maccoby et al 1993), especially as 
parents remarry and move to new locations. 

Conflict and lower cooperation between parents make adolescents feel 
caught between their parents, and when children live part of the time with each 
parent, the effects of discord between parents on whether adolescents feel 
caught in the middle is stronger (Buchanan et al 1991). Children who are 
involved in their parents' disagreements or who feel that they must manage 
their parents' relationship to make their lives run more smoothly are more 
likely to be depressed or to exhibit deviant behavior (Buchanan et al 1991). 

Fathers’ adjustment to the short-term crisis of marital dissolution affects 
children's emotional well-being. Nonresident fathers who continue spending 
time with children despite the pain of establishing a visiting relationship may 
combat the feeling of loss that children experience when fathers leave the 
household (Chase-Lansdale & Hetherington 1990). In addition, nonresident 
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fathers improve the quality of children's emotional environment by supporting 
the resident mother directly by reinforcing her childrearing practices and 
indirectly by providing emotional support and sharing the responsibility for 
childcare (Hetherington 1993). 


Emotional Security and Father Substitutes 


Grandparents and other kin may step in to alleviate the emotional trauma and 
disruption of divorce for children (Cherlin & Furstenberg 1986). In addition 
to comforting children, other adults provide social support which alleviates 
the resident mother's strain and improves the quality of her interactions with 
the children (Hetherington et al 1982). A stepfather may also enhance chil- 
dren's emotional welfare when a divorced mother remarries by reducing the 
amount of stress and economic insecurity that she experiences. However, a 
new person in the household brings new strains and challenges (Cherlin 1978). 
Compared to original fathers who live with their children, stepfathers are less 
engaged socially and emotionally with their stepchildren (Thomson et al 1992, 
Smith & Morgan 1994). When children have good relationships with their 
stepfathers, this enhances children's emotional adjustment and socia! and ac- 
ademic achievement (Hetherington 1993). Children's contact with nonresident 
fathers declines when mothers remarry (Seltzer & Bianchi 1988, Furstenberg 
et al 1983), and the strains of acquiring a stepfather at the same time that they 
lose ties with their original father may threaten children's emotional welfare. 


MARITAL DISRUPTION AND CHILDREN'S 
SOCIALIZATION NEEDS 


Children depend on adults to look after them and to supervise their behavior. 
Supervision occurs by direct monitoring, as when parents supervise children's 
homework. But the most important aspect of parental supervision is the daily 
socialization of children in which parents teach children explicitly and by 
example how to function in the larger society. Children's close relationship 
with parents facilitates effective socialization (Maccoby & Martin 1983). Par- 
ents’ behavior, including rules, discipline, communication styles and reason- 
ing, also contribute to children's socialization. For example, children whose 
parents separate receive less support, supervision, and encouragement about 
educational activities than do children whose parents stay together, and this 
difference explains, in part, the lower educational attainment of children from 
separated families (Astone & McLanahan 1991). [See McLanahan & Sandefur 
(1994) for a review of the degree to which differences in childrearing practices 
by household structure explain young adults’ well-being.] 

Fathers' involvement with children after separation provides an opportunity 
for them to continue to contribute to children's socialization. As shown in the 
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previous section, the vast majority of fathers disengage from children and so 
limit their opportunities to teach children values and practices. Among fathers 
who remain involved with children after divorce, most pursue recreational 
activities with children, such as going to parks or pursuing other amusements 
(Furstenberg & Nord 1985). Only a small proportion of fathers help their 
children with homework or pursue instrumental activities with them (Fursten- 
berg & Nord 1985, Teachman 1991). Compared to divorced mothers, divorced 
fathers are also less effective at monitoring children's behavior and keeping 
track of where the children are and their activities, even taking into account 
that children spend more time with their mother than with their father (Mac- 
coby & Mnookin 1992). In addition, Maccoby & Mnookin find that even when 
children spend time in each parent's household, mothers take care of more of 
the routine tasks of childrearing, such as keeping track of children's medical 
and dental needs, than fathers do. As with children's emotional needs, a 
stepfather’s support may help his new wife take care of her children, but 
stepfathers may be less effective or less interested in supervising stepchildren 
than they are their biological children. The level and quality of stepfathers' 
involvement with children depends on the quality of the mother-stepfather 
relationship. Stepfathers are more effective if they participate in childrearing 
indirectly, by reinforcing the resident mothers’ rules, than if the stepfathers 
discipline the children independently or if stepfathers avoid getting involved 
altogether (Hetherington et al 1985). 


CHILDREN, WOMEN—AND MEN 


Divorced children’s economic, emotional, and social needs are met largely 
through women’s efforts. Mothers support children and manage their daily 
care when they live together. Even when divorced children live with their 
fathers, or spend time in each parents’ household, mothers spend more time 
with children, organize more childrearing activities than fathers do, and super- 
vise children more effectively than fathers do (Furstenberg & Nord 1985, 
Seltzer & Bianchi 1988, Maccoby & Mnookin 1992). Compared to the quality 
of their relationship with their father, children’s closeness to their mother 
has a stronger, more consistent effect on children’s adjustment after divorce 
(Maccoby et al 1993). 

Not only do mothers care for children after divorce, they also orchestrate 
men’s relationships with children. Mothers’ managerial responsibilities receive 
attention in studies of married parents (Backett 1987). When parents separate, 
mothers’ control over relationships between fathers and children increases. 
Mothers organize children’s activities and may limit the amount of time that 
fathers spend with children as a strategy to increase fathers’ child support 
payments or because mothers are concerned about how well fathers take care 
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of the children when they are together. Limits on fathers' time with children 
also reflect the complexity of juggling children's, mothers', and fathers' sched- 
ules when both parents are employed and running their own households. When 
a father remarries, he spends even less time with his children from a previous 
relationship (Furstenberg et al 1983, Seltzer 1991b). The father devotes atten- 
tion to his new wife, and, often, her children (Furstenberg & Cherlin 1992), 
but here again his relationships with children are moderated by a woman (e.g. 
Marsiglio 1992, Seltzer & Brandreth 1994). 

Under some conditions, a father may be able to establish a relationship with 
children independent of his ties to their mother. In co-resident families, a father 
who is alone with children for extended periods of time without the children's 
mother develops routines and takes responsibility for children (Gullestad 
1984). However, mothers must relinquish some of their authority over child- 
rearing for this to occur, and mothers are more likely to do this when structural 
constraints, such as the requirements of shift work, force them to do this. 
Divorce, when parents set up separate households, is another structural change 
that increases the percentage of time a divorced father and his children spend 
alone together, away from the children's mother. Divorced fathers who gain 
experience interacting directly with children are more likely to stay involved 
with their children. However, divorced fathers face a tremendous challenge in 
establishing these ties, particularly in the period immediately following sepa- 
ration. Not only must a divorced father learn how to care for children on his 
own, but he must do this at a time when he and his children are under great 
emotional strain and in an unfamiliar setting (i.e. in a new household). Unfor- 
tunately, we know little about the conditions that enable divorced fathers to 
overcome these obstacles. 

Mothers are reluctant to relinquish authority over children even when they 
would be relinquishing it to the children's fathers. Mothers greatly fear losing 
their children in disagreements about custody (Weitzman 1985). For many 
women, divorce is a step toward autonomy and greater independence from 
men. Even formalizing fathers’ decision-making rights through joint legal 
custody threatens mothers' autonomy (e.g. when joint custody limits mothers' 
ability to move to a new state). At the same time, a high percentage of mothers 
say that they would like the nonresident father to play a larger role in raising 
their children (Furstenberg 1988). A mother may want the nonresident father 
to be involved in childrearing on her own terms, while the father also wants 
to control the situation. Men control more economic and social resources than 
women do, and this contributes to mothers' anxiety about managing child- 
rearing after divorce jointly with children's fathers. By facilitating divorced 
fathers’ independent involvement with children, divorced mothers may fear 
that they will lose many of their own decision-making rights because of fathers’ 
greater relative power. Future research should investigate within families the 
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effects of mothers’ and fathers’ relative power on the way they manage child- 
rearing after divorce. 

Without the formal and emotional benefits of paternity, divorced fathers 
have few incentives to stay involved with their children. Fathers respond to 
the loss of the benefits of paternity by withdrawing from their children at the 
same time that the fathers withdraw from (and are sometimes excluded by) 
their former wives. Instead, fathers devote themselves to new children acquired 
through cohabitation or remarriage. This pattern at the individual level con- 
tributes to a macro-level change in the structure of kinship, which combines 
marriage and childrearing for men, but separates marriage and childrearing for 
women. Because marriage is a short-term institution, men's relationships with 
children are also short term. While they are married to (or live with) the mothers 
of their children, men provide for the economic, emotional, and social needs 
of their biological children, but when the parents’ relationship dissolves, men 
typically withdraw from the social role of father until they establish a new 
marriage and acquire new children. Mothers, on the other hand, stay with their 
children but may acquire social fathers for them through remarriage or cohab- 
itation. 

That fatherhood has become for some a short-term commitment poses a 
serious challenge to studies of the process of disengagement and of how 
parents' relative power affects the resolution of childrearing disagreements. 
Fathers who disengage from their children are difficult to interview, primarily 
because they are difficult to locate (Lin & Schaeffer 1993, Maccoby & Mnoo- 
kin 1992, Braver & Bay 1992). Fathers who cannot be located even in the 
early period after parents have petitioned for divorce are less involved with 
their children (Schaeffer et al 1991) and may have a more highly conflicted 
relationship with the children's mother than fathers who can be located. Studies 
of parents’ relative power and the sources of fathers’ disengagement require 
information from both parents, and the loss of fathers who withdraw earliest 
from their children introduces severe selection biases in studies of postdivorce 
family relationships. A way to address this problem is to follow men and their 
children longitudinally, rather than by sampling from divorce court records. 
Men might be sampled at hospitals when their children are born, and research- 
ers could study young men's fertility and relationships with children using 
data from existing surveys, such as the National Longitudinal Survey of Youth. 
Improved understanding of the institutional change in marriage and parenthood 
also requires that future research take account of rising rates of cohabitation 
and childbearing outside of marriage in studies of parent-child separation. 

In two-parent, original families, mothers’, fathers’, and children's interests 
sometimes diverge. Divorce exacerbates these differences as parents set up 
separate households and reallocate control of joint assets. Although children 
are parents’ most precious resource, the decision to separate is sometimes 
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against the children's best interests. Because children depend on adults for 
physical and emotional care, divorce threatens children's well-being. They 
experience the loss of a parent's income, disruption of their daily routines, 
conflict between their parents, and reduced contact with a parent. As a result, 
children whose parents divorce suffer social and emotional disadvantages 
compared to children who live with two parents. Lone parents, usually mothers, 
may share responsibility for children with stepparents, grandparents, and the 
state (e.g. through welfare), but this assistance often does not reduce the 
disparity between children in one- and two-parent households. For children's 
sake, single parents need more institutional support, particularly to alleviate 
the economic disadvantages of divorce and the related problem of providing 
adequate childcare when mothers are employed. The challenges we must 
address are how to provide this support and how to divide responsibility for 
children's needs among mothers, fathers, and the state. 
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Abstract 


Sociobiology is the study of the biological bases of behavior in the context of 
neo-Darwinian evolutionary theory. This review organizes the contribution of 
sociobiology to the understanding of human behavior around a small number 
of major theoretical articulations. These include identification of the gene as 
the basic unit of selection; the concept of inclusive fitness and the role of 
relatedness; models of reciprocal altruism and the concept of evolutionarily 
stable strategy (ESS); and the theory of optimal reproductive investment. 
Sociobiological and evolutionary thinking is expected to affect sociology in a 
number of areas including sex and gender roles; the theory of collective action; 
and the elaboration of a richer and more complex model of human nature. 


INTRODUCTION 


Sociobiology has been defined by Wilson (1975:4) as "the systematic study 
of the biological basis of all social behavior." Specifically, sociobiology fo- 
cuses on evolutionary explanations of behavior within the context of the 
modern synthesis, or neo-Darwinian evolutionary theory, more than it does on 
proximate causes of behavior. Sociobiology also refers to the collective enter- 
prise, an "alliance of disciplines" (Lopreato 1992), that emerged to public 
consciousness in the mid-1970s with two influential statements: Wilson's 
(1975) monumental Sociobiology: The New Synthesis, and Dawkins’ (1989 
[1976]) The Selfish Gene, although the revival of Darwinism in social thought 
had begun substantially earlier (Degler 1991:215—44). Dawkins did not use 
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the term "sociobiology," which was chosen by Wilson to denote the emerging 
discipline, and many captivated readers of his book did not realize they were 
exposed to the same ideas which, as presented by Wilson, would generate a 
flare of controversy. 

The controversy, born of the fear of political implications, was to last several 
years (Sahlins 1976, Gould 1981, Lewontin et al 1984, Kitcher 1985). The 
controversial phase seems now to have abated, and sociobiological ideas have 
begun to make substantial inroads in anthropology and psychology (see Max- 
well 1991, for influences in other fields, and Fisher 1991). Sociology has been 
more resistant, with much of the field still influenced by an antibiological and 
antievolutionary outlook (Greenwood 1984, Dietz et al 1990, van den Berghe 
1974, 1990, Sanderson & Ellis 1992). Economics is a case apart, with a claim 
for having influenced sociobiology in the other direction by providing it with 
many of its formal analytical tools, such as optimization and game-theoretic 
models (Hirshleifer 1977, 1978a,b, 1985, Boyd & Richerson 1985:157-66, 
Smith 1992:21). While I use the term “sociobiology” in this review for con- 
venience, the evolution of behavior has been studied under different names, 
such as behavioral or evolutionary ecology, biocultural or biosocial science, 
and Darwinian or evolutionary psychology (see Cronk 1991), from a desire to 
establish distance from the debate surrounding Wilson (1975, 1978) or to 
express a particular theoretical outlook more precisely. A number of excellent 
textbooks on sociobiology are now available, including Wilson (1980), Barash 
(1982), Daly & Wilson (1983), Trivers (1985), Badcock (1991), and Krebs & 
Davies (1993). 

This review builds on the premise that the contribution of sociobiology to 
the understanding of human behavior can be organized around a small number 
of major theoretical articulations. These include identification of the gene as 
the basic unit of selection and of selfishness; the concept of inclusive fitness 
and the role of relatedness; models of reciprocal altruism and the concept of 
evolutionarily stable strategy; and the theory of optimal reproductive invest- 
ment, or effort (see Goldsmith 1991, for a similar approach). Here I review 
these major themes and illustrate the type of research that they have inspired. 
I then discuss briefly recent findings on sexual differentiation during human 
development and other research on proximate physiological causes of behav- 
ior, and attempts to integrate biological and cultural evolution. Finally, I 
conclude by arguing that sociobiological thinking is about to affect sociology 
in at least three major areas: theories of sex and gender; the paradox of 
collective action; and conceptions of human nature, or the psychological foun- 
dations of social behavior. The latter topic is examined in the context of the 
current debate between “Darwinian anthropological” and “Darwinian psycho- 
logical” approaches to evolutionary explanations of human behavior (Symons 
1989). 
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LEVELS OF SELECTION 


A fundamental insight of sociobiology is the clarification of the levels (gene, 
individual organism, or group) at which evolution by natural selection operates. 
The emphasis on the “selfish gene” is often associated with Dawkins (1989 
[1976]:32), who identifies the gene as the fundamental unit of selection and, 
consequently, the basic unit of selfishness. In the gene-centered view the 
individual organism becomes a temporary collection of genes that does not 
directly reproduce itself, a “survival machine" or “lumbering robot" (an ex- 
pression of Dawkins that so perturbed critics) constructed by genes to enhance 
their transmission to the next generation. It is interesting that Wilson's (1975:3) 
view is identical with Dawkins’: "In a Darwinian sense the organism does not 
live for itself. Its primary function is not even to reproduce other organisms; 
it reproduces genes, and it serves as their temporary carrier [...] the organism 
is only DNA's way of making more DNA [...] the hypothalamus and limbic 
system are engineered to perpetuate DNA." 

Even though the individual organism has lost its theoretical centrality as the 
unit of evolution and selfishness, it retains its privileged position as the main 
empirical focus of study, because it is after all the behavior of individual 
organisms that constitutes the original phenomenon of interest. In many cases 
where behaviors that benefit the genes are the same as those that benefit the 
individual, there is no difference between gene-level and individual-level se- 
lection. The evolutionary explanation can be phrased more intuitively by 
describing individual behavior as maximizing individual reproductive success. 
As is discussed later, an essential contribution of the gene-centered view of 
evolution is that it permits a natural explanation of altruistic behavior toward 
relatives, a case in which the "interests" of the genes and of the individual 
carrying them do not necessarily coincide. 

Even more controversial than the shift of the spotlight from the individual 
organism to the gene was the debunking of the group as the unit of selection. 
In the decades preceding the formal emergence of sociobiology, the under- 
standing of evolution by a large part of the public, as well as many biologists, 
rested on the idea that behaviors evolve “for the good of the species," because 
they benefit the survival of the group as a whole. This biological version of 
functionalism was aided by influential popular books, such as Ardrey's (1970). 
Group selectionism, the idea that entire groups, or more precisely local pop- 
ulations or demes, are subjected to natural selection, culminated in the elaborate 
presentation of Wynne-Edwards (1962). Wynne-Edwards argued that those 
groups will survive disproportionately in which individuals evolve a self-re- 
straining behavior that benefits the group as a whole at the expense of indi- 
viduals engaging in it, with the altruistic behavior eventually becoming fixated 
in a population. The mechanisms postulated by Wynne-Edwards are theoreti- 
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cally possible, he brought to bear an overwhelming mass of empirical evidence 
on the theory, and his presentation was so convincing that at least one major 
figure of sociobiology had to fight a personal intellectual struggle to dissuade 
himself of the belief in group selection (Trivers 1985:79—84). 

The rejection of group selectionism took place on two fronts, empirical and 
theoretical. On the empirical front, Lack (1966), for example, demonstrated 
that one of Wynne-Edwards' prime examples of altruistic self-restraint for the 
good of the species, the tendency of birds to lay a limited number of eggs, 
could be explained entirely as the result of individual-level selection. Lack 
showed that parents that laid the species-typical number of eggs produced more 
viable hatchlings than parents laying either more or fewer eggs. Thus the 
species-typical clutch size can be viewed as optimal for the reproductive 
success of the parents themselves, and appeal to altruistic self-restraint and 
group selection is unnecessary. (Similar mechanisms of optimal spacing of 
births have later been shown for human foragers such as the !Kung: see Blurton 
Jones 1989. Lack's original hypothesis had to be qualified in later research: 
Krebs & Davies 1991.) On the theoretical front, evolutionary geneticists ex- 
plored the properties of various mathematical models of group selection, com- 
ing to the conclusion that, even though group selection is theoretically possible, 
its existence depends on a combination of extreme circumstances rarely found 
in nature, so that group selection of altruistic traits may be viewed practically 
as nearly impossible in most cases (Williams 1966, 1975, Boorman & Levitt 
1980). 

The core of the theoretical case against group selection is explained by 
Maynard Smith (1989:178). By definition, an altruistic trait is one that benefits 
the survival of the deme, or local population, at a cost, measured as decreased 
reproductive fitness, to the individual who exhibits the trait. As shown in Figure 
1, the possibility of group selection rests on a comparison of several transition 
rates. As demes A composed of altruists have a lower extinction rate than 
demes S of selfish individuals, extinction rate AE is lower than rate SE. But 
since selfish individuals have greater fitness individually in competition with 
altruists, a group of altruists is sensitive to invasion by selfish individuals (by 
immigration or mutation), who would then drive out the altruistic gene from 
the local deme. There is therefore a transition rate AS, which is a function of 
migration of individuals between local environments and the relative advantage 
of selfish individuals in competition with altruists. Group selection of an 
altruistic trait therefore depends on the selection rate of groups (i.e. the differ- 
ence between rates AE and SE) being greater than the combined effect of 
individual selection for the selfish trait and the migration rate. Such conditions 
may be met in nature in some rare cases. An example of group selection may 
be the evolution of less virulent strains of the myxoma virus (a disease of 
rabbits) in Australia. Lower virulence of the virus is interpreted as "altruistic" 
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Figure 1 Model of group selection. Local environments (resource patcbes) can be in one of three 
states: A, containing a population of altruists; 5, containing a population of selfish individuals; E, 
empty. Arrows correspond to transitions between states. The benefit of altruism to the group entails 
that transition rate AE is lower than rate SE. But a local population of altruists can be converted to 
an S patch by a selfish immigrant, since altruists lose in individual competition with selfish 
individuals. Source: Maynard Smith (1989:178, Figure 9.7). 


behavior that benefits the local population of viruses within the host rabbit by 
delaying killing of the host. In such cases the rate of migration (reinfection) 
is low, and the group selection rate high, making group selection possible. It 
is important to note that the rejection of group selection is based on quantitative 
grounds (rarity of necessary conditions), not issues of principle. 

Group selectionism may be viewed as a biological equivalent of function- 
alism in the social sciences. The demise of group selectionism parallels the 
current lack of popularity of functionalist explanations in sociology. The socio- 
biological emphasis on selection at levels below the group (individuals or 
genes) is consistent with the methodological individualism inherent in rational 
choice models of behavior that are currently dominant in the social sciences 
(Badcock 1986, Coleman 1990). Smith & Winterhalder (1992b:49), for exam- 
ple, argue that evolutionary and rational choice explanations are often com- 
patible, with the proviso that "natural selection is needed to underwrite a theory 
of preference formation" (see also Hirshleifer 1977, 1987a,b). Sociobiological 
and rational choice explanations are not equivalent, however, since evolution 
can produce behaviors, such as those related to altruism discussed below, that 
seem "irrational" from the point of view of maximizing individual utility. 

The rejection of group selection leaves the explanation of altruism as the 
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central problem of sociobiology (Wilson 1975:3, Lopreato 1984, 1992). Al- 
truistic behavior, as it cannot originate from group selection or as a conspiracy 
of individual organisms, must have evolved in one of three ways: (1) it is only 
the appearance of altruism, the result of individual-level selection, e.g. the 
optimal clutch size of Lack (1966); (ii) individual-level altruism benefiting 
relatives is gene-level selfishness, as in Hamilton's (1964) theory of relatedness 
and inclusive fitness (sometimes called "kin selection"); (iii) apparent altruism 
and cooperation are stable equilibrium combinations of various behavioral 
strategies in the population, the “evolutionarily stable strategies" (ESS) of 
Maynard Smith (1989:126—36). The last two possibilities have important im- 
plications for the central role of kinship in human societies, and for the 
evolution of cooperation; these are addressed in the next two sections. 


RELATEDNESS AND INCLUSIVE FITNESS 


In a pair of classical papers, Hamilton (1964) laid the foundations of the theory 
of inclusive fitness and the selection of altruistic behavior benefiting relatives. 
The principle of the theory is simple: an individual can transmit its genes to 
the next generations in two basic ways, by having its own offspring or by 
benefiting the reproduction of relatives who share genes by common descent. 
Then, the benefit of an act in terms of reproductive success for the individual 
is a function of its cost to the individual and the benefit conferred on the 
relatives, weighted by the degree of relatedness (i.e. the proportion of genes 
shared with the relative by common descent). The overall success of an indi- 
vidual in reproducing its genes directly through its offspring or indirectly by 
helping relatives is called its inclusive fitness. This basic insight, while some- 
what awkward within a perspective that views the individual as the unit of 
evolution, is perfectly straightforward within the gene-centered understanding 
of evolution that Hamilton was helping to create. 

Hamilton's approach was spectacularly successful in explaining extreme 
cases of altruism associated with eusociality such as is found in insect societies 
such as bees and ants. In eusocial species there is a reproductive division of 
labor in which individuals (the workers) do not reproduce themselves but work 
instead on behalf of individuals engaged in reproduction (the queen). The 
workers routinely sacrifice themselves for the greater good of the community. 
It turns out that bees and other hymenopteras have a particular genetic system 
(called haplodiploidy) in which males have only one set of chromosomes, with 
the result that a female bee is more closely related to her sisters than to her 
own offspring. From the point of view of the female worker, it is more 
advantageous for her inclusive fitness to help her mother (the queen) produce 
more sisters than it is to produce her own offspring (Daly & Wilson 1983:48- 
51). While Hamilton's (1964) papers have sometimes been mistakenly inter- 


SOCIOBIOLOGY 273 


preted as restricted to the study of haplodiploid species (see Dawkins 1989), 
the theory was meant to apply to diploid species such as mammals as well. 
One of the momentous empirical findings of sociobiology was the discovery 
of eusociality (including sterility of the “worker” caste) in the naked mole rats, 
a species of mammals (Jarvis 1981, Dawkins 1989:313-17). The finding con- 
firmed that eusociality could evolve without haplodiploidy, and in a species 
phylogenetically much closer to humans. 

Hamilton's inclusive fitness theory predicts that milder forms of nepotistic 
altruism among animals will be structured along a gradient of relatedness. Such 
patterns have been well documented in ground squirrels (Sherman 1977, 1980), 
for example, and of course our own species in a wide variety of forms. 
Considerations of inclusive fitness may be viewed as the cornerstone of human 
family systems (van den Berghe & Barash 1977, van den Berghe, 1978, 1979). 
The propensity of individual humans to favor relatives over unrelated strangers 
is so conspicuous as to be taken for granted. Beckstrom (1985, 1989, 1993) 
investigates how legal theory and practice is pervaded by considerations of 
relatedness among individuals, and Judge & Hrdy (1992) show the importance 
of relatedness in explaining inheritance patterns in contemporary society. 
Among the most surprising applications of inclusive fitness theory is avuncular 
relationship in some societies, where the maternal uncle plays a greater role in 
raising a child than the husband of the mother, the presumed biological father. As 
a man has about 50% of his genes in common with his biological child and only 
2596 in common with a niece or nephew, the avuncular relationship seems to run 
counter to the logic of relatedness. The validity of this calculation is a function 
ofthecertainty of paternity of the child. In situations where paternity is uncertain, 
it may be biologically more advantageous for a man to devote efforts to raising 
nieces and nephews, who with great certainty are the offsping of his sister, than 
a presumed son or daughter of more uncertain relatedness. One would therefore 
expect a greater prevalence of the avuncular relationship in societies with a high 
incidence of adultery and consequently children of more uncertain paternity, a 
pattern that is found empirically (Alexander 1974, Kurland, 1979, van den 
Berghe 1979, Dickemann 1981, Hartung 1985). 

Mechanisms of kin recognition, the ability of an individual to assess the 
genetic relatedness of strangers, have been found in many species and are 
interpreted as permitting preferential treatment of relatives (see e.g. Trivers 
1985:129—35). The empirical research by Rushton et al (1984) and Rushton 
(1989) suggests that humans have the ability to detect and choose preferentially 
individuals genetically similar to themselves (even among nonrelatives). Van 
den Berghe (1981) explains ethnic solidarity as an extension of the profound 
human propensity to favor close kin over strangers, with cultural markers 
serving as clues for genetic relatedness. Lopreato (1984) likewise identifies a 
generalized propensity to homologous affiliations in humans. Crippen & Mach- 
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alek (1989) view kin-recognition mechanisms, together with mechanisms of 
deception and self-deception, as the basis of religion, as well as national or 
ethnic identity. They view religious (or national or ethnic) symbols as fictive 
or “perverted” kin-recognition clues destined to elicit group-directed altruism 
on the part of believers (see also Crippen 1988, Reynolds et al 1987, Reynolds 
& Tanner 1993). 

Among issues of relatedness and kinship, the topic of the incest taboo has 
been a prominent occasion of direct confrontation between sociobiological and 
nonbiological social scientific explanations. The thrust of the sociobiological 
explanation of the universality of the taboo is that it serves the evolutionary 
function of reducing the inbreeding depression, the increased risk of congenital 
defects in the offspring of relatives who each carries a recessive deleterious 
gene. The proximate mechanism of the taboo is an aversion to sexual activity 
with individuals with whom one is raised in early childhood, a hypothesis 
originally due to Westermark (1891). 

There are now three well-documented natural experiments where the West- 
ermark hypothesis has been evaluated (Durham 1991:286—360, Degler 1991: 
245—69). The best known case is the study of Israeli kibbutzim by Shepher 
(1983), who shows that unrelated children raised together in communal groups 
do not marry or engage in sexual relationships later in life, despite the entire 
absence of cultural opposition to, and even parental preference for, such 
matches. The aversion is explained as an unintended consequence of the close 
proximity in childhood (especially before the age of six). Two other cases, the 
greater failure rate of minor marriages in Taiwan around the beginning of this 
century, where families adopted a little girl and raised her as their own as a 
future wife for their son (Wolf & Huang 1980), and the much higher divorce 
rate for preferential marriages between patrilineal parallel cousins raised in 
close proximity in Lebanon (McCabe 1983), are both interpreted as the out- 
come of the same aversion generated by proximity in early childhood. Durham 
(1991:286—360) carefully tests alternative theories in his review of the evidence 
and finds support for a theory of optimal outbreeding, illustrating the enhance- 
ment of a biological predisposition by cultural mechanisms. From the socio- 
biological perspective, traditional social scientific explanations of the taboo, 
in terms of a reduction in confusion of roles within the family, or the benefits 
of the network of reciprocal ties created by the obligation of exogamy, are 
derived and secondary developments, not the primary explanation (van den 
Berghe 1980, 1983, Shepher 1983, Thornhill 1990). 


AGGRESSION, COOPERATION AND SYMBIOSIS: 
EVOLUTIONARILY STABLE STRATEGIES 


The concept of inclusive fitness provides explanations of altruistic behavior 
when protagonists share genes by common descent. The dismissal of group 
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(1) General case D C 
W, = fitness without interaction D HW, W + b 
c = cost of cooperation 
b = benefit conferred c W,-c W, + b-cts 
S = synergistic effect 


(2) Additive case D C 
(Prisoner's dilemma) 
Wo = 2 D 2 4 
b= 2 
c=] C 1 3 
a= 0 
(3) Synergistic case D C 
W = 2 D 2 4 
b= 2 
c=] C 1 5 
8-72 
(4) Iterated additive case D TFT 
(Iterated Prisoner's dilemma: 
case (2) repeated 10 times) D 20 22 
TFT 19 30 


Figure 2 Models of cooperation. Table entries are the fitness of a strategy for individual in row, 
given the strategy adopted by individual in column. The strategies are C, cooperate; D, defect; TFT, 
Tit for Tat: cooperate at first and then repeat the other's move. The general nodel is shown in (1). 
In the Prisoner's Dilemma (2), the dominant evolutionary strategy is defection, D. In the synergistic 
case (3), cooperation C is dominant (and an ESS) and will evolve by natural selection operating on 
individuals. The case (4) of the repeated Prisoner's Dilemma has a structure similar to the synergistic 
case (3): here too TFT in an ESS once evolved. Source: Modified from Maynard Smith (1989: 168-9, 
Figure 9.3, Table 9.1). 


selection left sociobiology without a theory to explain altruism and cooperation 
among unrelated individuals. Biology and the social sciences, however, offer 
numerous examples of such cooperative behavior, including coalition forma- 
tion among baboons, collaboration among males in a pride of lions, food 
sharing among vampire bats (Wilkinson 1984, 1990), intriguing cases of al- 
truism among dolphins and other cetaceans, sometimes directed at humans, 
and the famous case of tacit cooperation in the “live and let live" system during 
trench warfare in World War I, analyzed by Axelrod (1984). To explain such 
behaviors Trivers (1971) proposed a model of the evolution of reciprocal 
altruism (or contingent reciprocity). Trivers’ model was later interpreted by 
Maynard Smith in the context of a very important conceptual development in 
sociobiology, the notion of evolutionarily stable strategies (ESS). 


276 | NIELSEN 


Maynard Smith (1989:164—84) discusses the conditions under which coop- 
eration can evolve, using models based on game theory. Figure 2 illustrates 
models of interactions between two individual organisms who can choose to 
cooperate (C) or to defect (D). The entries in each table are interpreted as the 
fitness of an individual adopting the row strategy if the protagonist adopts the 
column strategy. Model 1 in Figure 2 shows the general model of cooperative 
interactions. Each individual has a baseline fitness Wp. Cooperation confers a 
fitness benefit b on each protagonist, but entails a cost c. In some situations, 
there may be an additional synergistic benefit s to cooperation, as in the case 
of males cooperating in a pride of lions, where the benefit of cooperation is 
greater than additive. Model 2 (in which s = 0) is the classical prisoner's 
dilemma. The well-known result is that in this case, no matter what the 
protagonist does (i.e. C or D), it pays the individual to defect. Cooperation 
will not evolve under such conditions. Model 3 shows the synergistic case, 
where the pay-off of cooperation is greater than the benefit of defection. In 
that case cooperation is so advantageous that it can evolve by individual 
selection alone. 

Even when the pay-off matrix resembles that of the prisoner's dilemma, as 
in Model 2, cooperation can evolve in repeated games using the strategy called 
“Tit for Tat" (TFT), which consists of cooperating at first and then simply 
repeating the previous move, cooperate or defect, of the protagonist. It can be 
shown that TFT, once it evolves, is (almost, see Dawkins, 1989:216-33) an 
evolutionarily stable strategy, or ESS. An ESS is a strategy such that, if almost 
all individuals in a group use it, no alternative strategy can invade the popu- 
lation. In the repeated games of Model 4, a single Defector interacting with a 
population of TFTs will have lower fitness, and the strategy will not spread 
in that population. In a famous series of tournaments, Axelrod (1984) pitted 
against each other computer programs representing different strategies in tbe 
iterated prisoner's dilemma and showed the remarkable robustness of TFT 
against a wide variety of strategies. (See also Axelrod & Dion 1988; for a 
critical discussion of the status of TFT as an ESS, see Boyd & Lorderbaum 
1987; see Boyd & Richerson 1990, 1992, Mesterton-Gibbons & Dugatkin 
1992, and Nowak & Sigmund, 1993, for recent developments; implications 
for economic theory are discussed in Hirshleifer 1978b, 1982, and Hirshleifer 
& Martinez Coll 1988.) 

Mechanisms of reciprocal cooperation underlie symbiosis or mutualism in 
nature. Symbiosis may have been at work in such a crucial event of evolution- 
ary history as the emergence of the eukaryotic cell (Margulis 1981). Wilkinson 
(1988, 1990) provides empirical demonstration that the practice of food sharing 
among vampire bats is based on reciprocal altruism, rather than being kin-di- 
rected, and he argues that, in large groups, contingent reciprocity is a more 
powerful mechanism of altruism than relatedness. Goodall (1986:380—81) 
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finds reciprocal altruism pervasive among chimpanzees, and she considers it 
"deeply rooted in our primate heritage." Anthropologists have applied such 
evolutionary analysis to the central issue of reciprocal sharing in human soci- 
eties. Sharing is approached as a risk-reduction device by Winterhalder (1986) 
and as a problem of collective action by Hawkes (1992). (See Hawkes (1993), 
however, for a critical view on the role of reciprocity.) 

Trivers (1971, 1985:388—89) has argued that the benefits of reciprocal 
altruism have been so important in the evolution of our species that the human 
emotional system has evolved traits directly related to the contingencies of 
reciprocal cooperation and TFT. These include propensities to friendship, to 
gratitude and sympathy, to moralistic aggression against noncooperative be- 
havior, to guilt and reparative altruism, to a sense of justice, and even a capacity 
for detecting deceit and for self-deception (Trivers 1991). In this view, the 
human psyche has evolved capacities to react in appropriate ways to various 
situations related to cooperation. Lopreato (1984:151-61) views what he calls 
reciprocation as a behavioral predisposition evolved by natural selection. Econ- 
omist Hirshleifer (1987a, 1993) discusses ways in which emotions (such as 
benevolence or malevolence) can be beneficial against protagonists behaving 
as rational maximizers, providing theoretical mechanisms for the evolution of 
such tendencies (see also Margolis 1982, Frank 1988). Mealey (1993) finds 
experimental support for the existence of selective memory for faces of cheat- 
ers, a trait that would be useful in the context of reciprocity. Alexander (1987) 
incorporates similar ideas in his theory of moral systems; see also Ayala (1987) 
and Irons (1991, 1994). 

Ideas such as these have been pursued in an intriguing research program by 
Cosmides & Tooby (1987, 1989, 1992), who find empirical evidence for the 
existence in the human mind of a specialized cognitive device to detect vio- 
lations of social contracts. Their experimental program is based on the Wason 
task, in which a subject is given a conditional hypothesis of the form If P then 
Q, such as "If a person goes to Boston (P), then he takes the subway (Q)." 
The subject is presented with four cards representing possible situations that 
have the P-status of the situation on one side and the Q-status on the other. 
The cards are face down with the back showing the values P, not-P, Q, and 
not-Q, and the subject is asked to tell which cards must be turned to "test" the 
conditional hypothesis, i.e. find violations of the hypothesis. When the task is 
framed in a descriptive context such as the "going to Boston" example, fewer 
than half the respondents actually give the logically correct answer (turn the 
card with P: ^A person goes to Boston," and the one with not-Q: "The person 
does not take the subway"). However, when the task is framed in terms of a 
rule expressing a social contract, such as “If a person is drinking beer (P), then 
he must be over 20 years old (Q)," about 7596 of the subjects are able to see 
that the person drinking beer (P) and the person underage (not-Q) must be 
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checked. After a comprehensive series of experiments designed to eliminate 
alternative hypotheses (that the better performance on the cheater-detection 
task is a byproduct of task familiarity, a general permission schema, or other 
content-independent general-purpose mental processes), Cosmides & Tooby 
(1992) conclude that the human mind includes a specialized content-dependent 
cognitive procedure for cheater-detection that is an adaptation for reasoning 
about social exchange. The work of Cosmides & Tooby, who label themselves 
“evolutionary psychologists,” is remarkable not only because of its promise 
in understanding human nature, but also because it provides a paradigm of 
research in which sociobiological issues are made amenable to experimental 
investigation. Implications of this work are discussed further below. 

The notion of ESS, while useful to analyze reciprocal altruism, has a much 
broader field of application. Crippen & Lopreato (1989:69—72) find an early 
version of the notion in the work of Pareto (1963 [1916]). It has been used to 
analyze the evolution of other forms of interactions among individuals such 
as aggression by Maynard Smith (1989:126—36), who shows that in certain 
models of aggression, such as the game of "chicken," the ESS is not a pure 
strategy (like TFT) but a mixture of strategies (see also Hawkes 1992). In a 
mixed strategy, either individuals exhibit different behaviors in different cir- 
cumstances, or the population is composed of stable proportions of individuals 
exhibiting different behaviors in all circumstances. The concept of an ESS 
therefore allows for the possibility of evolutionarily maintained diversity. 
Cohen & Machalek (1988:472—77) are able to integrate various theoretical 
strands in the literature on expropriative crime using the concept of mixed 
strategy to explain the prevalence of crime in different circumstances. Their 
approach, referring explicitly to the "evolution of behavioral diversity," makes 
implicit use of the ESS concept. 

A general aspect of the ESS notion is that it provides mechanisms of 
evolution of behaviors that are apparently "functional" at the level of a group 
without the necessity of assuming group selection. Only individual-level se- 
lection is involved in the evolution of an ESS. The notion permits the gener- 
ation of collective outcomes from the assumption of fitness-optimizing be- 
havior (or "rationality") at the level of the individual (or the gene). Such 
features of the ESS concept should prove appealing to sociologists in the search 
for nonfunctionalist explanations of collective phenomena, and for links be- 
tween micro and macro aspects of social behavior. 


STRATEGIES OF REPRODUCTION 


A distinction going back to Fisher (1958 [1930]) views lifetimes as composed 
of two main types of effort, somatic and reproductive (Cronk 1991, Borgerhoff 
Mulder 1992). Somatic effort is the organism's investment in its own growth 
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and maintenance. Reproductive effort is the organism's investment in repro- 
ducing its genetic material, which can be done indirectly by helping relatives 
that share genes by common descent (indirect reproduction), or directly by 
producing offspring. Direct reproductive effort can be subdivided further into 
mating effort and parental effort. This section is about direct reproductive 
effort. It starts with a brief discussion of the origin of sex. 


Origins of Sex & Parental Investment 


The evolution of sexual reproduction, in which offspring are produced from 
the recombination of genes from two parents, is one of the central puzzles of 
biology. Sex is paradoxical because it is generally more advantageous for the 
individual and for ber genes to reproduce asexually than to dilute her genetic 
material with someone else's. Current theories of the origins of sex emphasize 
the advantages of sexual reproduction in situations of high biotic interactions 
with other species, as it accelerates the rate of evolution of defenses against 
predators or parasites that are also evolving more efficient means of attack. 
Asexual reproduction is favored in situations where environments must be 
conquered quickly and there is little crowding and competition (Williams 1975, 
Maynard Smith 1978, 1989:237—46, Trivers 1985:315—30). Given that sexual 
reproduction has evolved, the evolution of two basic reproductive strategies, 
a female strategy (few large, relatively expensive, and immobile gametes) and 
a male strategy (many small, relatively inexpensive, and mobile gametes) has 
been explained as an instance of disruptive selection, in which both extreme 
strategies have greater fitness than intermediate strategies (Parker et al 1972). 

The reasoning underlying the explanation of the emergence of the two sexual 
strategies has been generalized by Trivers (1972) to explain the evolution of 
the great diversity of mating systems, life histories, and strategies of reproduc- 
tion and parenting in sexually reproducing organisms. The resulting approach 
is the theory of parental investment (or parental effort). Parental investment is 
defined by Trivers (1972:139) as "any investment by the parent in an individual 
offspring that increases the offspring's chance of surviving (and hence repro- 
ductive success) at the cost of the parent's ability to invest in other offspring" 
(quoted by Daly & Wilson 1983:77). The central idea is that producing off- 
spring entails costs, in terms of resources, time, energy, etc, and that natural 
selection will favor strategies that are optimal in terms of the allocation of 
these costs. In this view, natural selection shapes life histories by affecting 
choices between current vs future reproduction, or quantity vs quality of 
offspring (Borgerhoff Mulder 1992). 

A prime example of application of the theory is the explanation of the sex 
ratio, the tendency of many species to produce a ratio of males to females that 
is near one. The sex ratio theory, due to Ronald A. Fisher (1958 [1930]), was 
elaborated by Trivers & Willard (1973). The basic mechanism of the theory 
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is an example of the evolutionary stability of a mixed strategy. The central 
fact is that in a sexually reproducing population, every offspring has one 
(biological) father and one mother. For the population as a whole, therefore, 
the expected reproductive success (RS) of a male is the total number of 
offspring divided by the number of males, and the expected RS of a female is 
calculated in an equivalent way. Therefore, in a population with more males 
than females, the expected RS is greater for a female and, if the cost of 
producing a male or female offspring is the same, it becomes evolutionarily 
more advantageous to produce females, thereby reestablishing the sex ratio 
near unity. À movement in the opposite direction takes place when there is a 
surplus of females. The ESS, for each parent individually, consists in producing 
an equal number of males and females. 

The optimal sex ratio may deviate from unity as the result of a number of 
complications, such as different costs of male and female offsprings. Alderson 
(1993) finds results consistent with optimal sex ratio theory in his comparative 
analysis of the sex ratio of preagrarian societies as a function of differential 
male and female mortality and other factors, such as sex-differentiated patterns 
of dispersal of offsprings, that can affect the cost of children of each sex. 
Alderson finds that in societies with greater male mortality, the sex ratio at 
birth tends to be biased in favor of males. His analysis is consistent with 
Chagnon's (1983, Chagnon et al 1979) interpretation of the famous case of 
the Yanomamó of Brazil, where the lopsided sex ratio had been traditionally 
explained as resulting from female infanticide, and interpreted within a func- 
tionalist approach that is, in fact, equivalent to group selectionism (Harris 
1974:83-107). 

Also intriguing is the prediction of sex ratio theory that, given the greater 
variance in RS of males (see below), parents in favorable ecological conditions 
should favor the production of males, to gamble on the potentially large 
reproductive pay-off of males, while parents in adverse conditions should favor 
the safer strategy of producing females. In human terms, identifying favorable 
conditions with high social status, the prediction is that high social status should 
be associated with disproportionate production of males, and lower status with 
the production of females. This prediction has been verified in a number of 
societies and times (Teitelbaum & Mantel 1971, Dickemann 1979, Hrdy 1987, 
Boone 1988, Voland 1988, Cronk 1989, Mealey & MacKey 1990). The Fisher- 
Trivers-Willard theory of the sex ratio has now been supported in so many 
ways that Trivers (1985: 271-300) considers it “confirmed” (see also Charnov 
1982). 


Male & Female Reproductive Strategies 


The theory of sexual selection proposed by Darwin (1887 [1871]) is often 
viewed as his second major contribution to evolutionary theory and provides 
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the basis for understanding male and female strategies of reproduction (see 
Cronin 1991, for a nontechnical discussion). While sex ratio theory is based 
on the (tautological) equality of expected (or average) reproductive success of 
males and females, one of the central empirical findings for understanding 
differences in male and female strategies, and subsequent dimorphisms and 
behavioral differences, is the discovery that males and females typically differ 
in the variance of their reproductive success. Bateman's (1948) experiments 
with fruit flies showed two crucial patterns. First, the variance of reproductive 
success in drosophila is greater in males than in females: while most females 
produce some offspring, males vary much more in their reproductive success, 
with some males producing many offspring, some producing none. Second, 
reproductive success in males is directly related to frequency of copulation, 
with males copulating with more females producing more offspring, while for 
females it is not. 

Species differ in the extent to which the reproductive success variances of 
males and females differ. The ratio of male to female variance in reproductive 
Success is taken as a measure of effective polygyny. When the variance of 
males is greater, as in most (but not all) species, the ratio is greater than one. 
Bateman viewed the pattern by which the reproductive success of males is 
more dependent on the frequency of insemination than it is for females as 
explaining the prevalence of competition among males for access to females, 
and “an undiscriminatory eagerness in the males and a discriminatory passivity 
in females" (Bateman 1948: 365, quoted by Daly & Wilson 1983:82). The 
phenomenon can be explained in terms of Trivers’ (1972) theory of parental 
investment: the sex investing the most in the offspring (typically the female 
because she produces the largest gametes) constitutes a limiting resource for 
the other sex, and therefore the object of competition (see also Borgerhoff 
Mulder 1992:353). The concept of effective polygyny allows characterization 
of the mating systems of different species (or societies) along a continuum, 
from lower to greater polygyny. Parental investment theory is general in 
specifying that "[w]hichever is the sex with greater parental investment will 
be the sex that is courted, that competes less, and that survives better" (Daly 
& Wilson 1983: 159), and the theory explains cases of “sex roles reversals,” 
as in sea-horses, the giant water bug, and the phalarope bird, where males 
provide the largest proportion of care for the offspring. In these species, optimal 
strategies are switched and, for instance, "[phalarope] females do indeed com- 
pete for and court coy, reluctant males" (Daly & Wilson 1983:159). 


FEMALE MATING STRATEGIES The phenomena studied by Bateman are the 
basis of various strategies available (in the course of evolution, not as conscious 
choices) to both sexes to maximize reproductive success. In mammals gener- 
ally, females are “obligate” investors because of the lengthy gestation and the 
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provision of milk, so that males tend to compete for mating opportunities with 
females (Borgerhoff Mulder 1992). Dawkins (1989 [1976]) reasons that fe- 
males are typically in a position to invest as carefully as possible their large 
reproductive investment. One way females can maximize their reproductive 
success is by judicious choice of a male. Dawkins distinguishes two main 
strategies of female choice, the “domestic bliss" and “he-man” strategies. The 
“domestic bliss" strategy consists in choosing a male that is most likely to 
remain with the female after the offspring is born and provide resources (such 
as care or food) to help raise the offspring. As Dawkins describes it (1989:149) 
“The female looks the males over, and tries to spot signs of fidelity and 
domesticity in advance. [...] One way for a female to do this is to play hard 
to get for a long time, to be coy. [...] Feminine coyness is in fact very common 
among animals...." Among variants of the domestic bliss strategy are polyg- 
ynous species where males defend territories, such as a nest site or feeding 
ground. This type of mating system is called resource defense polygyny. In 
such cases, as in certain birds, females seem to engage in a delicate calculus 
in deciding between joining a single male with an inferior territory or a male 
with a better territory but who has already acquired a mate. In birds, the 
distribution of females per male territory holder has been successfully de- 
scribed with a model of optimal female choice called the polygyny threshold 
model (Orians 1969, Verner & Willson 1966). 

Betzig (1986, 1988) reviews the extensive ethnographic evidence of a cor- 
relation within human populations between male resource holding power and 
number of mates. Resource holding power has been variously measured as 
ownership of land, livestock, or capital, status, hunting success, chiefly rank, 
headmanship, personal violence (among the Yanomam9ó, Chagnon 1988), and 
size of the kinship network. Borgerhoff Mulder (1992:355—58) discusses some 
of the alternative explanations of these patterns. She concludes from her own 
research (1990) on the Kipsigis of Kenya that female choice, rather than 
competition among males, plays a predominant role in explaining multiple 
marriages in this population as women distribute themselves in close agreement 
with the polygyny threshold model. There is further evidence on the relation- 
ship between male resource holding power and mate number in antique Rome 
(Betzig 1992b,c) nineteenth century Sweden (Low 1991, 1993, Clarke & Low, 
1992, Clarke, forthcoming), and among Mormons (with church rank a measure 
of resource holding power, Mealey 1985). Betzig & Weber (1993) find less 
sexual inequality associated with political power in contemporary American 
society, while Hooks & Green (1993) analyze competition among females for 
males allies. 

In species where males do not participate in parental care, the female trying 
to optimize her investment is reduced to choosing a male on the basis of the 
presumed quality of his genetic endowment. This is what Dawkins (1989) calls 
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the "he-man" strategy. Female choice of males on the basis of exterior signals 
of "quality" can lead to a runaway process of sexual selection producing 
apparently dysfunctional traits in males, such as the colorful tail of the peacock. 
Dawkins suggests that in our own species (1989:164), "Notions of females 
witholding copulation until a male shows some evidence of long-term fidelity 
may strike a familiar chord. This might suggest that human females play the 
domestic-bliss rather than the he-man strategy." However, it has also been 
suggested (e.g. Symons 1979:127-41) that the unusual human characteristic 
of the absence of estrus in the female is an adaptation designed to give a female 
greater ability to choose the biological father of her children, by reducing the 
husband's ability to monitor or sequester her (see also Wilson & Daly, 19922). 
By not advertising ovulation, the human female has an increased opportunity 
to be fertilized by a male other than her husband (see also Hrdy 1988, and 
Buss 1989, 1992, 1994 for an evolutionary approach to sexual attractiveness 
in humans). 

While many human societies are monogamous, many more are polygynous, 
some are promiscuous, and very few are polyandrous. As Dawkins concludes 
(1989:164): 

What this astonishing variety suggests is that man's way of life is largely deter- 
mined by culture rather than by genes. However, it is still possible that human 
males in general have a tendency towards promiscuity, and females a tendency 
towards monogamy, as we would predict on evolutionary grounds. Which of these 
two tendencies wins in particular societies depends on details of cultural circum- 
stances, just as in different animal species it depends on ecological details. 


Borgerhoff Mulder (1992:358—361) reviews various factors that have been 
associated with mate number across societies. Factors associated with greater 
polygyny include the ability to monopolize power and wealth, environmental 
and political fluctuation, environmental extremes and uncertainty, and geopo- 
litical conscription making it costly to leave the social unit; a factor of socially 
imposed monogamy in stratified modern societies is the need to reduce the 
risk of armed insurrection; polyandry is associated with a restricted resource 
base. 


MALE MATING STRATEGIES Comparing species within families of mammals, 
including primates, sexual dimorphism (average size difference between male 
and female) is correlated with effective polygyny, or the intensity of compe- 
tition for females. With a dimorphism of 1.08, humans fit in the primate series 
as “mildly polygynous” (Wilson 1975), or “slightly polygynous” (Daly & 
Wilson 1983:297). Daly & Wilson (1983:279-311) explore the ramifications 
for human social behavior of competition among males (see also Daly et al 
1982, Wilson & Daly 19922). The nature of marriage in many societies is such 
that the reproductive capacity of women is treated as the commodity to be 
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exchanged or bought. The double standard in adultery laws and customs means 
that, quasi-universally, the victim of adultery is thought to be the husband 
whose wife has been involved with another man, rather than the wife of the 
adulterer. This feature is consistent with the configuration of reproductive 
interests in a species like ours where the male participates in a substantial way 
in raising the offspring and where paternity is “mistakable.” In such cases, 
cuckoldry and the risk of devoting considerable resources to raising another 
man's child are disasters from the point of view of fitness. The same consid- 
eration explains differences in the "psychological content" of sexual jealousy 
between men and women, with a tendency of men to focus more than women 
on the sex act, rather than the deprivation of resources or affection, the greater 
propensity of men to react violently to infidelity, the complex reaction of men 
to the rape of their mate, and generally men's "proprietary" view of women's 
sexuality and reproductive capacity (Daly et al 1982, Wilson & Daly, 19922; 
see also Thornhill & Thornhill 1983). The same logic underlies the central 
role of paternity in family law (Wilson, 1987), the great attention paid to the 
resemblance of a newborn baby to the father (Daly & Wilson 1982), and the 
widespread use of coercive constraints of women's sexuality, including con- 
finement, infibulation, and other genital mutilations. 

Finally, other vestigial evidence for a history of polygynous competition are 
the shorter life span and greater risk proneness of males, suggesting a life 
strategy of higher risk for higher rewards, as one would expect for the sex 
with greater fitness variance (Wilson & Daly 1985), possibly manifested in a 
“show-off” strategy (Hawkes 1991). Daly & Wilson (1988a,b) and Wilson & 
Daly (1992b) have elaborated these consequences of Bateman’s principle for 
the understanding of homicide in human societies. They argue with a great 
deal of empirical evidence that a very large proportion of homicides in human 
societies can be traced to competition among men for status, or "face." These 
symbolic, intangible goals, in turn, are directly relevant to success in the 
competition for access to females. Irons (1979a), Betzig (1986), and especially 
Barkow (1977, 1984, 1989) have emphasized the link between sexual selection 
and the pursuit of culturally defined prestige criteria (see also Durham 1990: 
202). Sexual competition among males has therefore been seen as the ultimate 
motivation behind social hierarchies (Betzig 1986, Lopreato & Green 1990, 
Green 1991, Boone 1992, Ellis 1993), an interpretation consistent with the 
observation by a nonbiologically oriented researcher of the greater preoccu- 
pation of males with hierarchies (Gilligan 1982). 


Strategies of Parenting 


Borgerhoff Mulder (1992:341), after Lessels (1991), proposes that a multitude 
of trade-offs in life history decisions can be subsumed under two major dilem- 
mas: current versus future reproduction, and offspring quantity versus quality 
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(see also Chisholm 1993). In relation to the second dilemma, MacArthur (1962) 
proposed a general distinction between r- and K-strategy as two ideal types of 
life history strategies found in nature. The symbols r and K refer to elements 
of the logistic population growth, in which r denotes the natural rate of increase 
of the population, and K the carrying capacity of the environment for the 
population. The typical r-strategist species produces many offspring quickly, 
riding out a high rate of offspring mortality. The r-strategist is able to colonize 
new environments quickly. The K-strategist evolves to be able to raise a few 
offspring slowly with greater investment of resources in each. The K-strategist 
maximizes efficiency in competition in crowded conditions near the maximum 
carrying capacity. Each strategy can be described by a number of characteristic 
traits listed by Pianka (1970). 

Despite criticisms (Stearns 1977), the r and K distinction has proved endur- 
ingly useful (Daly & Wilson 1983:199-205, Dawkins 1989 [1976]). The 
typology of life history strategies has influenced the sociological literature at 
a different analytical level in the sociology of organizations, where it has been 
used to describe organizational strategies (Hannan & Freeman 1977, 1989). 
At the individual level, the r and K strategies bave been roughly equated with 
an emphasis on mating effort vs parenting effort, respectively (Draper & 
Harpending 1988). In psychology, Belsky et al's (1991) "evolutionary theory 
of socialization" incorporates the r and K distinction as a main ingredient. 
Rushton (e.g. 1991) interprets human race differences in (1991:29) “diverse 
[associated] traits including brain size and intelligence, maturational delay, 
sexual restraint, quiescent temperament, and social organization” as represent- 
ing ordered positions of African, Caucasian, and Asian racial groups on a r to 
K gradient, in a research program that is controversial and rather atypical of 
contemporary sociobiological research, which tends to emphasize commonal- 
ities of human nature (Degler 1991:219). 

Among animals, mammals tend to be K-strategists, and among mammals 
humans are among the most so, together with elephants and higher primates 
(Clutton-Brock 1991). Humans typically produce few offspring, investing a 
lot in each. Human reproductive behavior, however, is a function of environ- 
mental and other factors, and issues of life history strategies lead into the 
demographic study of human fertility, including the interrelationships of phys- 
iological (such as lactational anovulation) and environmental factors determin- 
ing optimal birth spacing. Blurton Jones & Sibly's (1978) model of birth 
spacing among the !Kung foragers, for example, is based on the trade-off 
between more offsping with lower reproductive value vs fewer offspring with 
higher reproductive value. Here again the basic theoretical mechanism is the 
optimization of parental investment, given ecological and evolutionary con- 
straints. 

Consideration of birth spacing brings us to the other major life history 
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dilemma, concerned with the timing of reproduction and the trade-off between 
current and future reproduction. Life history theory can be used to explain the 
timing (or existence) of three key events in the life history of human females 
(Cronk 1991:41—2): menarche, menopause, and death. The age at menarche is 
interpreted as resulting from a trade-off between the fitness advantage of 
beginning reproduction as early as possible, and the lower survival chances of 
babies born to an immature mother, complicated with the potential inclusive 
fitness benefits of delaying one's own reproduction to help one's mother raise 
siblings. If indirect benefits of delay exist, one would expect elder daughters 
to mature later than late-birth-order daughters. Menopause may have evolved 
in the human female because older females may increase their fitness better 
by helping their daughters to reproduce than by producing another child them- 
selves. In males, investment in reproduction is much less expensive, so meno- 
pause did not evolve and the loss of sexual function with age is gradual 
(Pavelka & Fedigan 1991). Consistent with this interpretation, Hawkes et al 
(1989) show how "hard-working grandmothers" among the Hadza foragers 
spend much time in food gathering, allowing their daughters to increase their 
own parental effort (see also Turke 1988). Finally, the timing of senescence 
and death itself may be the result of natural selection, either as the result of 
waning helping opportunities in postreproductive mothers (Voland & Engel 
1989), or as a result of the more general theory of senescence as manifestation 
of accumulated late-acting deleterious genes (Medawar 1952, Hamilton 1966, 
Dawkins 1989:40—2, 274). 

The basic theory of the provision of parental care by the two sexes has been 
formulated by Maynard Smith (1977, see also Irons, 1979b; Clutton-Brock 
1991). Borgerhoff Mulder (1992:362—-72) approaches the issue of parental care 
in general, and particularly in the human context the variability in paternal 
care, in terms of the costs, benefits, and opportunity costs of parental care to 
either sex. For a male, (362) “When an additional unit of paternal care yields 
a lower fitness return than would an equal investment in obtaining copulations, 
males would be expected to seek matings rather than continue to invest in their 
offspring; conversely, expenditure should be in parental care where the pay-off 
exceeds that of an identical expenditure in mating." The male dilemma is 
between the “dad” or “cad” strategies of Harpending & Draper (1986). An- 
thropologists have investigated a number of factors that may affect the provi- 
sion of paternal care (and associated consequences, such as the stability of 
couples and general level of promiscuity), including how essential the father 
role is to survival of the child (more care, Harpending & Draper 1986), 
substitutability of female and male roles (less care), paternity certainty (more 
care, Gaulin & Schlegel 1980, Flinn 1981), male absence (less care, as in 
colonial era Iroquois), a female-biased sex ratio (less care, more promiscuity, 
Guttentag & Secord 1983, Hurtado & Hill, 1992), low success by males in 


SOCIOBIOLOGY 287 


pursuit of copulations (more care, Hawkes 1990). Finally, the variable inter- 
actions between levels of care provided by both parents are studied by Chase 
(1980). 

Infanticide and child abuse are topics that are (maybe unexpectedly) related 
to these theoretical issues. Biologists have studied the practice of infanticide 
among primates and other species (Hrdy 1977, 1979, Hausfater & Hrdy 1984, 
Daly & Wilson 1988b:37-93). Infanticide is routinely practiced among langur 
monkeys and among lions by a new male taking over a group of females. The 
male thereby prevents the females from devoting resources to another male's 
offspring and accelerates the return of estrus in the females. Such infanticide 
by, or at the instigation of, the new husband of a woman is apparently common 
in some human societies, such as the Yanomamó (Daly & Wilson, 1988b:47). 
The high incidence of this type of infanticide among animals and in some 
human societies is highly suggestive of a possible relationship in modern 
industrial societies between the presence in the household of an adult not 
biologically related to the child, such as a stepparent, and the risk of death or 
abuse. Daly & Wilson (1988a, 1991) find such a relationship. While other 
researchers disagree with the severity of the risk increase, the role of step- 
parenting in the etiology of child abuse has become an active topic of research 
(Gelles & Lancaster 1987, Gelles 1991). Concepts of reproductive effort and 
strategies and of fitness optimization are being integrated in recent treatments 
of these issues (Burgess & Draper 1989, Burgess 1991). 

Trivers (1974) has approached parent-child relations in a different light, by 
offering a theory of the conflict of interests between parents and child over 
the allocation of resources. He shows that there is a point in the development 
of the child when the fitness interests of the parents would lead them to allocate 
fewer resources to the child than its own fitness interests would dictate. In 
such a situation, one would expect conflict in the family about how parental 
resources are allocated. It would seem that such models might be useful in the 
fields of the psychology of development, or even psychological therapy. Daly 
& Wilson (1990) provide a discussion of these issues. 

Sociobiological theories of the life history and optimal allocation of repro- 
ductive efforts are now informing an emerging synthesis in works on parenting 
or the family, which is well represented by van den Berghe (1979), Lancaster 
et al (1987), Lancaster (1989a 1989b), Lancaster & Kaplan (1992), Troost & 
Filsinger (1993), and Rossi (1984, 1994). 


HUMAN DEVELOPMENT, SEXUAL DIFFERENTIATION, 
AND OTHER PROXIMATE MECHANISMS 


Three areas of research not generally considered part of sociobiology proper 
are nevertheless associated closely because they have the potential to provide 
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the proximate mechanisms of evolutionary explanations of behavior; thus they 
deserve a brief mention (see Barchas 1976, for an early review). These are the 
study of sexual differentiation during human development, the assessment of 
differences in mental functioning between men and women, and the study of 
the relationships between hormone levels and behavior. While sociobiologists 
have investigated evolutionary explanations of strategic behavioral differences 
between the sexes, knowledge of the proximate causes of many such differ- 
ences, in humans as well as in animal models, has made considerable progress 
in recent years (see Daly & Wilson 1983:243—77). Beginning with the classic 
experiment of Jost (1953), researchers have used animal experiments and 
human clinical evidence to document a basic scenario of development in 
mammals in which the embryo develops masculine morphological and behav- 
ioral characteristics if androgens (male hormones) are present at a specific 
stage early in development, and exhibits feminine development otherwise. 
Hormones therefore play an early "organizing" role in sexual differentiation 
of the body and brain. Hormones later play a distinct "activating" role in the 
production of sex-typical behavior at puberty and later in the life course (Ellis 
1986, Udry & Billy 1987, Udry 1988). The basic physiological scenario is 
well established, and research is currently under way examining the roles of 
prenatal androgens exposure, adult hormone levels, and environmental factors 
in the determination of gender roles, in an extraordinary sample of adult women 
who had been serendipitously tested for steroid hormone exposure before birth 
(Udry et al 1992). 

There is a great deal of knowledge about the relationships between testos- 
terone level and behavior, much of it summarized in Kemper (1990). Testos- 
terone level has been found to affect, and to be affected by, behaviors such as 
deviance (Dabbs & Morris 1990, Booth & Osgood 1993); winning and losing 
in human competition, including such nonphysical pursuits as chess (Booth et 
al 1989, Mazur et al 1992); sex roles (Ellis 1986); marital outcomes for males 
(Booth & Dabbs 1993), and occupational achievement (Dabbs 1992). Among 
proximate mechanisms other than hormones is the role of facial features and 
expressions, such as Mazur et al (1984) found between physical features of 
armed forces cadets and later rank achievement, or the physiological aspects 
of communication via mutual gaze (Mazur et al 1980, see also Rosa & Mazur 
1979, Freedman 1979). 

Differential psychologists have accumulated new findings concerning psy- 
chological differences between men and women. While the classic review of 
Maccoby & Jacklin (1974) found only four documented areas of difference 
(greater verbal ability of girls, greater visual-spatial ability, mathematical 
ability, and agressiveness of boys), their method of categorizing different 
specific abilities into general abstract constructs such as “sociability” has been 
criticized as obscuring real differences in behavior (Daly & Wilson 1983:261— 
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66). In a way consistent with the new strategy of evolutionary psychology to 
look for specific “modules” of behavior, recent research has investigated much 
more distinct abilities, such as the mental techniques used by both sexes to 
remember and find a route (women using landmark recognition, men using 
geometric clues), fine manual skills within personal space (women favored), 
and coordination of body movement in reference to external objects, as in 
aiming and throwing (men favored, Kimura 1992, Hampson & Kimura 1992). 
This research is currently painting a much richer and more interesting picture 
of male-female differences, parts of which readily admit of evolutionary ex- 
planations (Kimura 1987:145—6). 

While researchers in these associated fields are sometimes reluctant them- 
selves to draw the evolutionary implications of their findings, it seems likely 
that this body of knowledge will some day be naturally integrated into a 
comprehensive understanding of human nature based on an evolutionary per- 
spective. 


GENES, MEMES, AND THE COEVOLUTION OF GENES 
AND CULTURE 


In the last chapter of the 1976 edition of The Selfish Gene, Dawkins proposed 
that cultural evolution could be viewed in a way analogous to biological 
evolution. The "replicators" of cultural evolution analogous to genes are units 
of cultural transmission that live in human brains and are transmitted by 
imitation, which he called memes. Lumsden & Wilson (1981:7 1983) base 
their comprehensive theory of genes-culture coevolution on the concept of 
culturgen, which they view as similar to Dawkins' meme. The idea that cultural 
evolution can be approached in an evolutionary framework has led to a number 
of attempts to integrate mechanisms of biological and cultural evolution. Clear- 
ly, such approaches can potentially integrate sociobiological knowledge from 
biology into sociological theory. While a full discussion of these attempts is 
much beyond the scope of this review, Durham (1990, 1992) has provided 
reviews of major recent contributions within the larger enterprise he calls 
“evolutionary culture theory." The five treatments include: the “Darwinian 
theory of culture" (Boyd & Richerson 1985, 1992, Richerson & Boyd 1978, 
1989, 1992); the "cultural transmission model" (Cavalli-Sforza & Feldman 
1973, 1981, Cavalli-Sforza et al 1982); the “coevolution model" (Durham 
1979, 1991); the "cultural and biological success model" (Irons 1979a, Barkow 
1989); and the "gene-culture transmission model" (Lumsden & Wilson 1981, 
1983, 1985). Other integrative treatments of biological and sociocultural evo- 
lution indigenous to sociology could be added to Durham's list, including 
Lenski, Lenski & Nolan (1991), Lopreato (1984:236-66), and Sanderson 
(1991). While these theoretical attempts all agree that cultures are transmitted 
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by learning or imitations in "bundles" of cultural items of various size (anal- 
ogous to Dawkins’ memes or Lumsden & Wilson's culturgens), and that 
cultures evolve because some bundles acquire more carriers than others, the 
theories differ most in the main mechanism of differential social transmission 
of cultural bundles that they postulate (Durham 1990:198—202). For current 
research in sociology along these lines, see Allison (1992a,b,c) and Crippen 
(1994). It should be noted that the ultimate goal that Durham (1990, 1992) 
assigns to evolutionary culture theory is both grandiose and concrete, as it 
consists of the full reconstruction of the evolution by "descent with modifica- 
tion" of all cultural systems from a common ancestral culture, along the lines 
of recent successes of historical linguistics in tracing further back the common 
ancestry of contemporary languages (e.g. Greenberg, in press, Renfrew 1994). 


SOCIOBIOLOGY AND SOCIOLOGY 


The sociobiological enterprise is now supported by several active journals, 
including Ethology & Sociobiology, Human Nature, Behavioral and Brain 
Sciences, Journal of Social and Evolutionary Systems (formerly Journal of 
Social and Biological Structures), Politics and the Life Sclences, and Evolu- 
tionary Anthropology, and two interdisciplinary associations, the Human Be- 
havior and Evolution Society (HBES) and the European Sociobiological 
Society (ESS). Sociobiological ideas have already profoundly affected the 
fields of psychology and anthropology (for psychology see papers in Barkow 
et al 1992; for anthropology see Krebs & Davies 1991 and Smith & Winterhal- 
der 19922), and are about to affect sociology in major ways. Some subfields 
have already been influenced or are obvious candidates, such as sociology of 
the family (van den Berghe, 1979, Filsinger 1988, Lancaster 1989a; Rossi 
1990, Troost & Filsinger 1993), criminology (Daly & Wilson 1988a,b, Cohen 
& Machalek 1988, Forthcoming 1994, Ellis & Hoffman 1990, Machalek 1991), 
and parts of demography concerned with fertility and the life course (e.g. Low 
& Clarke 1992, Low 1993, Rossi 1994, Lopreato & Yu 1988, Carey & Lopreato 
1993). The sociology of emotions is an intriguing case, as the field seems to 
have been strangely devoid of evolutionary thinking despite Darwin's (1965 
[1872]) early interest in the topic, and even though researchers are close to 
unravelling the physiological-chemical underpinnings of emotions (Kemper 
1987). This field seems ripe for an evolutionary takeover, possibly through 
links with psychology and psychiatry where sociobiological thinking is already 
more established (McGuire et al 1982, Badcock 1986, Crawford et al 1987, 
Buss 1989, Nesse 1990, Nesse & Lloyd 1992, Wenegrat 1990). Sociobiology 
is likely to affect sociology further in at least three central areas: gender roles 
and the nature of male-female differences, the problem of collective action, 
and the psychological foundations of human behavior. 
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Sex & Gender 


While past research on differences between men and women has sometimes 
rejected any consideration of biological origins (e.g. Gilligan 1982), the accu- 
mulating knowledge concerning the evolution of various male and female 
strategies in species facing different sets of environmental contingencies prom- 
ises the emergence of a richer and more sensible understanding of sex differ- 
ences in human behavior (Rossi 1984). Such a new approach to gender “roles” 
(a term that would itself become questionable) would move away from the 
oversocialized view of human beings by combining evolutionary and compar- 
ative models of sex-differentiated strategies (e.g. Hrdy 1981, Wilson & Daly, 
19922) with the growing knowledge of sexual differentiation during develop- 
ment (Daly & Wilson 1983:243-77) and the physiological underpinnings of 
gendered behavior (Udry et al 1992), the detailed study of male-female dif- 
ferences in brain functioning (Kimura 1987, 1992, Hampson & Kimura, 1992), 
and a clarified assessment of the role of genes in individual development (Scarr 
& Weinberg 1978, Scarr 1992,, MacDonald 1988). It should be noted that the 
emerging sociobiological view is clearly compatible with a strong feminist 
orientation (Dickemann 1979, Hrdy 1981, Rossi 1984, Lancaster 1991, Smuts 
1992, Wilson & Daly 1992a, Gowaty 1992, Green 1993). 


Collective Action 


Sociobiology may have a major impact on sociological theory in relation to 
the issue of collective action. As discussed above, sociobiological theory has 
addressed the paradox of altruism by identifying two powerful mechanisms 
that can cause the evolution of behaviors that seem to benefit others to the 
detriment of the individual exhibiting it: Hamilton's (1964) notion of inclusive 
fitness; and Trivers’ (1971) model of reciprocal altruism together with May- 
nard Smith's (1989) treatment of it in the ESS context. The social sciences are 
currently bogged down in their own version of the paradox of altruism. Here 
the puzzle is called the "free rider" problem, or the "paradox of collective 
action" (Olson 1965). Collective action is a theoretical problem in a rational 
choice perspective assuming individual utility maximization, because paying 
the cost of participation in collective action to secure a collective good lowers 
individual utility, in the same way that altruistic behavior lowers the fitness 
of the individual engaging in it. Formal models of inclusive fitness and ESS 
models of reciprocity developed in sociobiology may provide useful tools for 
the solution of the collective action puzzle (Axelrod 1984, Axelrod & Dion 
1988, Hirshleifer & Martinez Coll 1988), and the concept of ESS may prove 
more generally useful in developing theories that link micro and macro aspects 
of behavior (e.g. Cohen & Machalek, 1988). 

A genuine solution to the collective action problem may necessitate an 
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overhaul of the psychological assumptions about individuals that are used to 
explain such macrolevel phenomena. In the standard behavioral model of 
economics, more recently extended to sociology as the rational choice per- 
spective (Becker, 1976, Coleman 1990), individuals are assumed to maximize 
their subjective utility. Collective action is paradoxical because the simplified 
(selfish) utility maximizer is viewed as engaging in the rational calculus anew 
in every situation where the opportunity for participation in collective action 
arises. In this view, the decision to cooperate or defect is reevaluated here and 
now in the mind of the maximizer, and under most conditions the rational 
individual will not cooperate (Olson 1965). What sociobiological models of 
the evolution of reciprocal altruism suggest, instead, is that the Prisoner's 
Dilemma has been played over and over in evolutionary time, resulting in the 
evolution as an ESS of an entire set of emotional propensities specifically 
related to this kind of social interactions, including a sense of guilt, of outrage 
at the defection of others, a capacity for friendship, and a sense of justice 
(Trivers 1985:388—89). These tendencies would make up the general recipro- 
cation propensity of Lopreato (1984:151—61). The empirical work of Cosmides 
& Tooby (1992) documents the existence of a "look for cheaters" mental 
algorithm that may be part of such a set of evolved propensities. If such a 
claim is true, even in part, it follows that theoretical attempts to explain 
collective action need to assume actors with much richer and more complex 
motivations, maybe along the lines of the “solidary incentives” once postulated, 
without much evidence, by Fireman & Gamson (1979), to explain certain forms 
of collective action. 


Human Nature 


The augmentation of the individual-rational model of behavior with reciproc- 
ity-related evolved propensities in order to resolve the paradox of collective 
action is only one example of the way evolutionary thinking is likely to affect 
psychological assumptions about human behavior, i. e. conceptions of human 
nature. Historian Carl Degler (1991) sees the emergence of sociobiology as 
the current manifestation of a long-term revival of Darwinism in the under- 
standing of human nature in American social thought, an upswing that may 
have begun as early as four decades ago. Assessment of the eventual impact 
of sociobiology in understanding human nature is informed by a current debate 
within sociobiology between groups that Symons (1989) has labelled the 
“Darwinian anthropologists” (DAs) and the “Darwinian psychologists" (DPs) 
(see also Symons 1987, 1992, Turke 1990). (The two camps do not coincide 
with the academic fields, as many Darwinian psychologists, in particular, are 
anthropologists.) 

Simplifying somewhat, the Darwinian anthropologists hold that “human 
behavior is adaptive in the sense of being designed to maximize reproductive 
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success and that measurement of reproductive differentials typically illu- 
minates adaptation" (Symons 1989:131). Darwinian anthropologists often take 
what Grafen (1984) calls the “phenotypic gambit”: as it is often not feasible 
to demonstrate the heritable basis of a behavior, the phenotype (i.e. the trait 
or behavior to be explained) is assumed to be adaptive and to optimize the 
inclusive fitness of individuals. As inclusive fitness is hard to measure directly, 
Darwinian anthropologists often use proxy measures (or "proximate curren- 
cies") of fitness such as fertility rate or productivity of resources (Smith & 
Winterhalder 1992b:33—34). Symons (1992) presents as typical of Darwinian 
anthropologist research the study of Tibetan polyandry by Crook & Crook 
(1988), who conclude from their analysis of reproductive data on Tibetans 
who did and who did not marry polyandrously that polyandry is adaptive in 
the highly unusual environmental conditions characterizing the region. Dar- 
winian anthropologists often assume a large measure of phenotypic plasticity 
and the ability of actors to assess the relative pay-offs of various courses of 
actions. These assumptions are actually very close to those of economic theory, 
or the rational choice model in sociology. For examples of research in the 
Darwinian anthropologist perspective, see Chagnon & Irons (1979) and Betzig 
et al (1988). 

The Darwinian psychologists (or evolutionary psychologists) argue, to the 
contrary, that *measuring reproductive differentials is at best an inefficient and 
ambiguous way to illuminate adaptation, and that Darwin's theory of natural 
selection sheds light on human affairs only insofar as it promotes understanding 
of the psychology that underpins these affairs" (Symons 1989:131). In the 
Darwinian psycbologist view, the hominid brain/mind combination has been 
sbaped by millions of years of evolution in the foraging lifestyle of the pleis- 
tocene, and natural selection has endowed the human brain/mind with functions 
and propensities designed to solve distinct recurrent problems encountered in 
this "environment of evolutionary adaptedness" (EEA) (van den Berghe 1981, 
Barkow 1989, Tooby & Cosmides 1989, 1992, Symons 1992). Darwinian 
psychologists therefore tend to think of the human mind as "modular," com- 
posed of mostly special purpose and domain-specific mechanisms (sometimes 
called "Darwinian algorithms"), rather than consisting of a general purpose 
learning mechanism underlying a generalized capacity for culture acquisition. 
Three examples of such specialized modules are the mental structures for 
cheater-detection of Cosmides & Tooby (1992) discussed above; the innate 
"janguage acquisition device" (LAD) described by Pinker & Bloom (1992, see 
also Hurford 1991) that permits learning of the mother tongue by the young 
child; and a male propensity to a proprietary view of women's sexuality and 
reproductive capacity (Daly et al 1982, Wilson & Daly 19922). The Darwinian 
psychologists differ from the Darwinian anthropologists in emphasizing that 
behavioral predispositions that optimized fitness in the EEA are not necessarily 
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adaptive in a modern environment and may even reduce fitness, just as our 
evolved taste for sugar and fat may be maladaptive in a modern environment 
in which availability of these substances is practically unlimited. It follows 
that, for Darwinian psychologists, current reproductive differentials are an 
ambiguous clue that a behavior is adaptive, and that psychology (i.e. the 
investigation of human nature) is the crucial link between evolutionary theory 
and behavior (Cosmides & Tooby 1987, 1989, Tooby & Cosmides, 1989, 1992, 
Daly & Wilson 1988b:6-11). 

The Darwinian anthropologist approach may superficially appear as more 
appealing to many sociologists, as it emphasizes phenotypic plasticity, is 
largely compatible with rational choice, may be viewed as inherently less 
"biological" than the Darwinian psychologists' approach, and the emphasis on 
differential reproduction makes it naturally amenable to quantitative analysis. 
Sociologists may also be less attracted to the Darwinian psychologist position 
because of a reluctance to focus on individuals, from a fear of psychological 
reductionism (Margo Wilson, personal communication). It seems, however, 
that the Darwinian psychologist approach has a greater affinity with the evo- 
lutionary overhaul of the conception of human nature that is currently taking 
place, according to Degler (1991:270-92). It may be that the resolution of 
central sociological issues, such as the problem of collective action discussed 
above, necessitates augmenting homo economicus beyond the bare bone utility 
maximization model, by incorporating into a new working model of human 
nature the modular faculties and propensities that evolutionary psychology will 
continue to unravel (Lopreato 1984, 1990, 1993). As this evolution takes place, 
thinking about psychological foundations of social behavior will have ironi- 
cally gone back full circle to retracing the intellectual evolution of Pareto (1963 
[1916]) who, after establishing the modern formalism of the utility maximiza- 
tion (rational) model in terms of indifference curves (Pareto 1971 [1909]), 
went on to argue for the necessity of a more complete depiction of human 
nature, to explain the large residual category of what he called “non-logical” 
actions. (In modern terms, these are actions that cannot be explained by rational 
choice assumptions.) Pareto (1963 [1916]) developed his theory of sentiments, 
residues, and derivations to provide this more realistic model of human be- 
havior (Crippen & Lopreato 1989, Crippen 1992). The similarity between 
Pareto’s “residues” and Lopreato’s (1984) “predispositions,” and the “mod- 
ules” and “Darwinian algorithms” of evolutionary psychologists is inescapable 
(Tooby & Cosmides 1992). The new influx of evolutionary thinking in soci- 
ology may finally usher in the realization of Pareto’s venerable project. 
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Abstract 


The new sociology of knowledge examines how kinds of social organization 
make whole orderings of knowledge possible, rather than focussing on the 
differing social locations and interests of individuals or groups. The review 
begins with the effects on knowledge of the media through which it is pre- 
served, organized, and transmitted. We then analyze collective memory, ex- 
amining social conditions that shape how knowledge is transmitted through 
time. The review then examines how patterns of authority located in organi- 
zations shape both the content and structure of knowledge, looking at how 
authority affects the scope, generality, and authoritativeness of knowledge. We 
then review recent work on how social power, particularly that embodied in 
institutional practices, shapes knowledge. We examine how knowledge rein- 
forces social hierarchies and how the boundaries and categories of systems of 
knowledge are constituted. Looking at power, gender, and knowledge, we 
discuss new versions of the standpoint theories that characterized the traditional 
sociology of knowledge. Finally, we briefly review recent work on informal 
knowledge. 


INTRODUCTION 


The older sociology of knowledge epitomized by Mannheim asked how the 
social location of individuals and groups shapes their knowledge. Elements of 
this tradition became institutionalized in sociology and political science as 
attitude and opinion research. The sociology of knowledge proper, however, 
concerned with the social sources of knowledge and political ideologies, fell 
out of favor. Mannheim's work has continued to inspire current scholarship 
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("The Problem of Generations" [1952 (1928)] as stimulus for Wuthnow [1976] 
or Schuman & Scott [19891), but the tradition has come under criticism. Its 
image of the relationship of knowledge and social position seems reductionist 
(Geertz 1983:152—3), and it has too thin a conception both of knowledge and 
of the social positions or interests that affect knowledge. 

Recently sociologists interested in culture, religion, science, and ideology, 
along with scholars in social history, philosophy, anthropology, and the history 
of science, have begun to revitalize the field. The expansion of cultural studies 
throughout the social sciences has also greatly enriched the materials a soci- 
ologist of knowledge has to work with. While there is as yet no unified field, 
many diverse strands of theory and research have begun to crystallize around 
common themes. 

Changes in the phenomena encompassed by the term "knowledge" are 
symptomatic of changes in the field. The traditional sociology of knowledge 
focussed on formal systems of ideas, concentrating especially on such matters 
as the world-views and politics of intellectuals. (This review largely neglects 
the sociology of intellectuals, though we note the lively debates about the 
interests and social locations of contemporary intellectuals—Ehrenreich & 
Ehrenreich 1977, Gouldner 1979, Eyerman et al 1987, Szelenyi & Martin 1988, 
Brint 1994). The search for social interests that bias even supposedly neutral, 
disinterested, objective understanding of the world—what the very term 
“knowledge” connoted—was central to the agenda of the field. 

Newer work in sociology and cultural studies suggests that formal systems 
of ideas are linked to broader cultural patterns—what we might think of as 
social consciousness. We focus not only on the ideas developed by knowledge 
specialists, but also on structures of knowledge or consciousness that shape 
the thinking of laypersons. We do not, however, attempt to cover all aspects 
of culture. The sociology of culture has focussed largely on works of art and 
entertainment. In cultural studies, culture connotes symbolic systems that are 
deeply embedded, taken-for-granted, often enduring, and sometimes invisible. 
The sociology of knowledge instead directs attention to cultural elements that 
are more conscious, more explicitly linked to specific institutional arenas, and 
more historically variable. 

The new sociology of knowledge examines how kinds of social organization 
make whole orderings of knowledge possible, rather than focussing in the first 
instance on the differing social locations and interests of individuals or groups. 
It examines political and religious ideologies as well as science and everyday 
life, cultural and organizational discourses along with formal and informal 
types of knowledge. It also expands the field of study from an examination of 
the contents of knowledge to the investigation of forms and practices of 
knowing. 

This review begins with a fundamental factor that shapes the ways knowl- 


THE NEW SOCIOLOGY OF KNOWLEDGE 307 


edge can be structured—the media through which knowledge is preserved, 
organized, and transmitted. It then turns to the analysis of collective memory, 
examining social conditions that shape how knowledge is transmitted through 
time. The review then examines how patterns of authority located in organi- 
zations shape both the content and structure of knowledge. We bring together 
work on how forms of authority affect the scope, generality, and authoritative- 
ness of knowledge. We then review recent work on how social power, partic- 
ularly that embodied in institutional practices, shapes knowledge. In the next 
section, we examine how knowledge reinforces social hierarchies and how the 
boundaries and categories that define the basic terms of systems of knowledge 
are constituted. Looking at the recent literature on power, gender, and knowl- 
edge, we discuss revitalized versions of the standpoint theories that character- 
ized the traditional sociology of knowledge, exploring how new approaches 
deepen the understanding of what a social standpoint involves. Finally, we 
turn briefly to recent work on informal knowledge, that knowledge ordinary 
people develop to deal with their everyday lives. 


MEDIA AND THE STRUCTURE OF KNOWLEDGE 


Perhaps the most dramatic example of how social factors affect the basic 
structure of knowledge is what Goody & Watt (1963) called “The Conse- 
quences of Literacy." Historians, anthropologists, and psychologists have ex- 
amined how the introduction of new media for the recording, transmission, 
and cumulation of knowledge changes knowledge itself. Walter Ong (1971, 
1977, 1982), in a set of sweeping arguments, has contrasted the organization 
of literate and oral cultures, arguing that the media in which words are trans- 
mitted have repeatedly transformed consciousness. 

Others (Olson 1977, Goody 1986, Graff 1987, Finnegan 1988) have drawn 
the contrast between orality and literacy less starkly. Akinnaso (1992) notes 
the presence of both formal and informal learning in literate and nonliterate 
societies, showing that formal learning can create intellectually disciplined, 
specialized, decontextualized knowledge even in nonliterate societies. Ewald 
(1988), arguing that writing conveys authority only when state power privi- 
leges it (Clanchy 1979), examines the fascinating case of an African kingdom 
whose rulers actively resisted writing in order to maintain the flexibility and 
ambiguity of the gift exchanges on which their power depended. 

Eisenstein (1969, 1979) has argued that print, which multiplied and thus 
preserved identical copies of texts, decisively transformed the shape of schol- 
arly knowledge: corrected texts could be assembled and replicated, freed from 
the inevitable corruptions of scribal transmission; rediscovered texts could be 
permanently rather than only temporarily recovered; authorship of texts could 
be established; texts and authors could be placed in a firm historical sequence; 
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and knowledge could be redefined as cumulative progress, rather than as 
inevitable decay from a pristine past. Marc Bloch (1961[1940]) vividly de- 
scribes how feudal elites, often dependent on oral transmission of knowledge, 
jumbled chronologies and unwittingly assimilated new practices into an ap- 
parently unchanging tradition. 

Because literacy and especially print so profoundly altered both knowledge 
and knowing, the early-modern period has proven particularly fruitful for 
study. Eisenstein (1969) notes that the first effect of print was to give a new 
lease on life to medieval books, creating a flood of texts of mixed provenance. 
Mukerji (1983) traces the influence of printed objects, from pictorial prints to 
printed maps and books, on materialism in European culture. She argues that 
print stimulated both production and consumption and that ways of appropri- 
ating printed books became a metaphor for scientific exploration of the natural 
world. 

Print culture infused by a still vigorous oral tradition appears to have a 
special vitality (Bakhtin 1984, Thompson 1963, Levine 1977). Ginzburg 
(1980) offers a remarkable account of how literate and oral cultures interacted 
when print made the richly fabulous world of medieval books more widely 
available and reading began to give ordinary people confidence in their own 
ideas. People who read very few books read them in a radically different way 
than modern readers do, elaborating particular passages out of context and 
filtering what they read through the screen of oral culture. 

Historically specific modes of reading have been explored by numerous 
scholars. Darnton (1984:215—56) describes the intimate, passionate reading of 
prerevolutionary French readers. Grafton (1991, 1992) has analyzed the read- 
ing practices of scholars in earlier eras, while other historians (Chartier 1987, 
1989, Vincent 1989, McKitterick 1990) have explored print and reading in 
both popular and elite contexts. 

Marshall McLuhan (1962, 1964) speculated that television would again alter 
human consciousness as print had done, creating instantaneous, immediate, 
globally shared communication. But the effects of new media on knowledge 
remain unclear despite decades of mass communications research, perhaps 
because formal knowledge remains bound by print and reading even while 
popular knowledge is increasingly visual, multi-channeled, and interactive. We 
do not yet know whether and how the computer revolution will alter formal 
knowledge, perhaps eventually supplanting the book and undoing the "fixity" 
Eisenstein attributed to print. 


COLLECTIVE MEMORY 


Studies of collective (Halbwachs 1980 [1950]) or social (Feutress & Wickham 
1992) memory ask how social groups retain, alter, or reappropriate social 
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knowledge. This work has developed in two important directions. First, re- 
searchers have shown that much presumed tradition is in fact "invented" to 
serve current social purposes (Hobsbawm & Ranger 1983, Wilford 1990), 
especially defining nations and the character of national communities (Ander- 
son 1983, Schwartz 1987, Hobsbawm 1990, Kammen 1991). Students of the 
French revolution have shown how new traditions and rituals helped create 
new political realities (Hunt 1984) and, less charitably, how memories such 
as that of the storming of the Bastille (Schama 1989) were concocted by 
political entrepreneurs. On the other hand, Schwartz (1991:234) has empha- 
sized how, in shaping the present, memories of the past provide “a stable image 
upon which new elements are superimposed.” 

The second important line of research on collective memory involves sys- 
tematic analysis of the factors that lead events or objects to be retained or lost 
as part of the stream of collective memory. Schudson (1992) has carefully 
analyzed how Watergate entered and influenced collective memory. Using the 
image of culture as a repertoire or repository (Swidler 1986), he asks why 
some things are retained and others forgotten. He notes that events are more 
likely to be remembered if they happened during one’s lifetime, if they are 
commemorated, if they touched people personally, and if they concern the 
public center of national life, as Watergate did. He also notes that events may 
be remembered indirectly (as when new scandals are dubbed “___gate””) or 
institutionally (when new rules or procedures, like that of the “special prose- 
cutor” are created). Schudson (1989:175) has systematized arguments about 
the power of culture, arguing that “a cultural object is more powerful the more 
it is within reach, the more it is rhetorically effective, the more it resonates 
with existing opinions and structures ... the more thoroughly it is retained in 
institutions, and the more highly resolved it is toward action.” 

Interest in the uses and practical determinants of cultural memory come 
together in the study of literary or artistic canons—what is preserved as part 
of a cultural heritage. Escarpit’s (1971) brilliant work made clear that “exter- 
nal” factors as well as the qualities of aesthetic works themselves affect what 
will be retained in the culture. He demonstrated that political upheavals strong- 
ly influenced whether works entered the French literary canon, and that 
whether a book was likely to be preserved as a “great work” depended on 
whether its author was part of a cohort young enough to keep the work alive 
until a new generation could rediscover it. Lang & Lang (1990), in an analysis 
of the reputations of English etchers, point out that survivors who preserve, 
catalog, and promote an artist’s work greatly increase its chances for renown. 

Barbara Herrnstein Smith (1983) argues somewhat polemically that what 
enters the literary canon depends on the interests of those who control it. 
Tompkins (1985) has shown how Hawthorne’s work was systematically pro- 
moted by those whose status claims he embodied, and Tuchman & Fortin 
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(1984) have examined the effects of gender on literary reputation. Haskell 
(1976) for art and Griswold (1986) for theater have examined social conditions 
that lead to revivals or rediscoveries of objects from the artistic corpus. Gris- 
wold's systematic exploration of why specific genres of English renaissance 
plays resonated with the social dilemmas of later centuries is echoed in 
Schwartz et al's (1986) analysis of the revival of Masada in Jewish collective 
memory. 

Hareven (1979) has suggested that basic demographic structure—whether 
children have living parents, grandparents, or great-grandparents to pass on 
firsthand accounts of the past—will affect the possibilities of historical mem- 
ory. Howard Schuman and collaborators (Schuman & Scott 1989, Schuman 
& Rieger 1992a,b) have used open-ended survey questions about important 
events and changes during the past 50 years to explore relationships between 
generation and the events that stand out in memory (they find that adolescence 
and early adulthood are a formative period for historical memory, as Mannheim 
[1952 (1928)] hypothesized) and between such memories and other attitudes. 

While the factors shaping collective memory have been most clearly spec- 
ified for art worlds, others have examined how organizations preserve and 
retrieve memory (Powell 1986) and how whole societies remember (Connerton 
1989). These studies raise the tantalizing possibility that similar research could 
be done on knowledge more narrowly conceived, asking how structural or 
institutional factors influence which work by philosophers, economists, or 
physicists will be disseminated, preserved, or made canonical. 


AUTHORITY AND ORGANIZATION 


New models of how social organization influences ideas are at the heart of the 
new sociology of knowledge. A major pioneer has been David Zaret (1985, 
1989, 1991, 1992). The Heavenly Contract (1985) argued that theological 
change in English Puritanism derived from organizational "pressures" and 
intellectual “precedents.” Challenges to the authority of Puritan clerics led 
them to develop “covenant theology” which refocussed ministers’ authority 
on helping laypersons monitor their inner lives rather than on seeking radical 
reforms which would put Puritan clerics in open conflict with the Anglican 
church. Covenant theology made salvation a predictable outcome of a covenant 
between God and man. It drew on intellectual precedents, such as widespread 
knowledge of the mutual obligations commercial contracts entailed, to make 
its ideas plausible to lay listeners. 

Zaret (1989, 1991, 1992) extends these ideas, arguing that liberal-democratic 
ideology emerged in seventeenth-century England as “a collective response to 
the problem of contested authority” (1989:165). Sectarian conflict and reli- 
giously inspired radicalism following the triumph of Puritanism in the English 
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Civil War provided the “episodic context” that bore upon “ideological produc- 
ers.” Liberal ideology substituted religious toleration for efforts to build a 
“Godly Commonwealth” and natural religion for sectarian doctrinal commit- 
ments. Zaret shows also that the English proponents of liberal-democratic 
theory and of natural religion were loosely linked by “networks of friendship, 
patronage, and formal organizations” and that they shared “access to the 
intellectual precedents for the new ideology” (p. 65). 

A focus on problems of authority in contexts that directly affect ideological 
producers contains the seeds of a powerful, general approach to the sociology 
of knowledge. To take authority first: It seems reasonable to believe that the 
authoritativeness of knowledge is grounded in patterns of social authority. To 
have authoritative knowledge is to have an institution, group, or person which 
can settle disputes and establish truth. Swidler (1979:118—30) observed that 
alternative schools that renounced authority had to dispense with right and 
wrong answers to intellectual questions. Arditi (1994) has traced broad trans- 
formations in theories of manners in eighteenth-century England to a shift in 
the structure of authority within which social elites operated. Walzer (1973) 
noted that sixteenth-century English Puritanism appealed to "sociologically 
competent" elites anxious about challenges to their authority. Martin (1993) 
speculates that the authoritativeness of any belief system depends ultimately 
on the authority of persons and that a group's authority structure affects its 
epistemological assumptions. 

The post-Kuhnian (1970) sociology of science, particularly comparisons of 
organizational practices across academic disciplines, suggests that the coher- 
ence of intellectual *paradigms" is related (whether as cause or effect) to the 
extent of hierarchy and coordination in the social organization of various fields 
(Lodahl & Gordon 1972, Hargens 1988, Levitt & Nass 1989, Konrad & Pfeffer 
1990). Indeed, Crane (1976) and Kuhn (1969) have suggested that what dis- 
tinguishes science from other cultural enterprises such as art or religion is its 
institutional autonomy, and particularly its relatively autonomous control over 
its own reward system, in contrast to the dependence of the arts and religion 
on lay audiences and powerful patrons. Wuthnow (1987, 1989) has emphasized 
that the growing authoritativeness of seventeenth- and eighteenth-century sci- 
ence depended on its acquiring a single, secure source of patronage in the 
nation state. 

While this is not the place to review the broad spectrum of work in the 
sociology of science, new research on scientific practice and scientific work 
organization (Knorr-Cetina 1981, Latour & Woolgar 1979, Latour 1987) be- 
gins to link substantive features of scientific knowledge to scientific work 
organization. Gerson (1993), for example, has analyzed the organizational 
dynamics involved in the “segmentation,” “intersection,” and “legitimation” 
of lines of scientific work. Gumport (1990, 1991), examining the case of 
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feminist scholarship, explores how new academic fields are created and legit- 
imated. Star (1989) analyzes the basis for the simultaneous "plasticity" and 
"coherence" of scientific theories. "Successful scientific theories reflect com- 
mitments to work practices that are not easily changed. This does not occur 
as the result of some self-propelling quality of ideas, but rather as the conse- 
quence of commitments to training programs, technologies, standards, and 
vocabularies [which are] difficult to disentangle or dismantle" (p. 22). 

These lines of work in the sociology of science converge with work by Zaret 
and others to suggest that knowledge derives many of its features from the 
way it organizes knowledge-producing communities. Star (1989:116) asks how 
scientific knowledge coheres without a central authority, observing that mul- 
tiple, localized practices and findings are "joined across sites and ... trans- 
formed to certainty at larger scales of organization." While Star, Latour & 
Woolgar, and others observe that the actual practices of scientific laboratories 
are highly local and that they undergo extreme simplification and reification 
before they are constituted as scientific fact, this picture of a localized, nego- 
tiated order must be balanced against the forces promoting hierarchy and order 
within scientific communities. The unequal distribution of academic rewards 
such as employment, career mobility, salaries, fellowships, and prestige mean 
that even though scientific communities lack unified authority, their basic 
social organization forces them to act as if some ideas are better than others, 
some problems and problem solutions more important then others, and so forth. 
Thus the manufacture of scientific certainty may well be a product of such 
central activities as departments deciding whom to hire, fellowship committees 
assessing research proposals, and young scientists seeking grounds for select- 
ing problems. A unifying hierarchy among ideas is built into the structures 
that allocate academic rewards, even while local variations in the routines that 
organize scientific work make dissensus both possible and invisible. 

The theoretical focus on authority relations is just one example of a broader 
movement in the sociology of knowledge toward attention to the specific 
organizational contexts in which knowledge producers work. Robert Darnton's 
work on The Business of Enlightenment (1979) and on the consequences of 
royal censorship in Old Regime France (1982) demonstrates how the contexts 
in which culture is produced and distributed affect its intellectual content. 
Important work in the sociology of religion (Butler 1990, Finke & Stark 1992, 
Warner 1993) provides evidence that, at least for America, religious partici- 
pation may be better explained by "supply"—that is, what religious providers 
offer—than by "demand"—that is, independent changes in religious needs or 

The "production of culture" perspective (Peterson 1976), developed in the 
study of the arts, is now bearing fruit in the general sociology of knowledge. 
Wuthnow's (1989) ambitious work suggests that broad economic changes and 
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changes in class structure influence ideas through the institutions that organize 
culture production. For example, political bodies independent of traditional 
landowning classes incubated and defended Reformation reforms (pp. 81—82); 
state patronage created the public sphere which was the seedbed of Enlight- 
enment thought; and European socialism was more successful where liberal 
parties were too weak to compete for working class support. Critics (Calhoun 
1992, Gould 1992) have challenged some of Wuthnow's historical arguments, 
but his work is pathbreaking in two respects: first, he distinguishes several 
stages in the process by which any new ideology emerges: "processes of 
ideological production, of selection among competing ideologies, and of in- 
stitutionalization" (p. 538). Second, Communities of Discourse links the insti- 
tutional settings of knowledge production to the content and form of bodies 
of ideas. 

An intriguing question is what institutional supports make plausible the 
authoritative, universal rationality characteristic of modern thought. Both Web- 
er (1958 [1904—1905]) and Durkheim (1965 [1912]) explored this question. 
Weber argued that Calvinism spurred rationalization in all spheres of life, 
including modem science (Merton 1970 [1929], Cohen 1990). Durkheim ar- 
gued that the development of an increasingly universal world society made it 
possible for ideas to take universal form, to claim a universalized truth. 

Wuthnow's (1989) analysis of Enlightenment discourse gives empirical 
substance to such a Durkheimian claim by suggesting that the “dispersed, 
overlapping, yet segmented character of social relations" (p. 320) in the eigh- 
teenth-century public sphere contributed to distancing public roles from per- 
sonal lives for Enlightenment figures and also to emphasizing universal argu- 
ment and abstract generalization (pp. 342—343 ff). Universalism also “fit well 
with the increasing levels of communication, trade, and diplomacy that were 
creating a stronger sense among educated elites of Europe as a single, or 
potentially single cultural zone" (p. 343). Zaret (1989) makes clear that en- 
shrining rationality as the objective ground of public debate was part of an 
effort to transcend divisive religious and political conflict. As Wuthnow (1989: 
343) puts it: "In trumpeting the general over the particular, writers from Locke 
to Rousseau were directing strong criticism at the parochial passions that had 
caused much of the seventeenth century to be dominated by war and were 
siding with the voice of tolerance and peace." 

Recent work on global culture also suggests reappropriating Merton's 
(1957) distinction between locals and cosmopolitans (Hannerz 1990). Scat- 
tered evidence suggests that those who must regularly deal with an imperso- 
nal, distant cultural world organized by abstract principles such as 
individualism or rationality construct knowledge differently than do those 
located socially and intellectually in more parochial settings (see Lerner 1958, 
Horton 1971, Bernstein 1975, Roof 1978, Hewitt 1989). Deemphasizing local 
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or regional variations, Benavot et al (1991) stress increasing global 
strandardization, pointing out that modern nation states, despite differing 
internal needs and different histories, have tended to adopt similar educational 
systems and school curricula. 

Several scholars (Moore 1966, Meyer 1980, Thomas et al 1987, Featherstone 
1990) discuss the intellectual consequences of a global culture. Robertson 
(1992) sketches the concrete historical turning points in the formation of a 
global culture, such as the creation of global competitions (Nobel prizes, the 
Olympics), agreement on world time, and the near-universal adoption of the 
Gregorian calendar. These international intellectual agreements in effect insti- 
tutionalize and make real the universal authority of the rational to which 
Durkheim referred (and indeed Durkheim wrote during the period [1870-1920] 
in which, according to Robertson, these important elements of a global culture 
were institutionalized). If we take for granted that a minute is really a minute, 
or a meter a meter, or that we really can know what time it is in Tokyo or 
Addis Ababa, the apparent objectivity of the world we inhabit rests on an 
institutionalized global culture. 


POWER AND PRACTICES 


The contemporary sociology of knowledge has been deeply concerned with 
power (Ortner 1984, Lamont & Wuthnow 1990), especially the work of Michel 
Foucault. Despite difficulties in interpreting Foucault's work, his fundamental 
insights may be put simply: First, historical eras differ not only in what people 
think, but in what is thinkable. Foucault (1973) has written of changes in 
"epistemes"—not simply systems of classification, but the logic in terms of 
which these classifications are constructed. Different epistemes are character- 
ized by discrete rules of separation and association among things—similarity 
through resemblance in the classical age and by causal association in the age 
of reason. Second, for Foucault (1965, 1977, 1980), power is embodied in 
practices or techniques which have their own histories (Mann 1986). Foucault's 
*genealogical" method traces historical transformations in techniques of pow- 
er, rather than seeking causal links between forms of power and other social 
formations. 

Third, for Foucault (1977, 1978, 1980), techniques of power are also, si- 
multaneously, forms of knowledge. So, for example, the monastic practice of 
confession made real corresponding forms of knowledge, such as the varieties 
of sin or techniques for recognizing and recounting these, just as the ability 
of asylums to confine and segregate the mentally ill enact psychiatric knowl- 
edge of diagnosis and cure. Studies of such institutions as prisons, mental 
hospitals, or clinics show how institutional practices make categories of knowl- 
edge real. “Dividing practices" such as those of modern psychiatry define the 
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characteristics by which the normal may be recognized and separated from the 
“deviant”; practices of “objectification,” such as the academic disciplines, turn 
aspects of human life into objects of analysis; and other practices, such as 
psychotherapy or self-labeling, create "subjects" who define and categorize 
themselves (Foucault 1983). 

New forms of knowledge also create new sites where power can be applied 
(and where resistance can form). Only, for example, when individuals are seen 
as endowed with complex, interiorized psyches can a battle ensue over whether 
‘to liberate or repress unconscious drives. 

It has now become almost commonplace to argue that new categories of 
persons are created historically (see Hacking 1986). Foucault's arguments have 
stimulated varied work on how institutional practices ground systems of knowl- 
edge. For example, the modern state's need to define and control populations 
led to new statistical techniques and new ways of categorizing persons (Hack- 
ing 1982, Rabinow 1989, Woolf 1989). Hacking (1990) has traced the con- 
catenation of intellectual and practical problems—the use of mortality data to 
calculate profits from government sale of annuities or the use of social statistics 
to characterize nations—which by the nineteenth century had transformed a 
causally determinist world into a probabilistic one. 

The academic disciplines have come under scrutiny for the ways their basic 
theories and methods reflect larger structures of power (MacKenzie 1981). 
Talal Asad (1973) has explored how the British empire's practice of indirect 
rule led British anthropology to discover autonomous traditions and stable, 
functional institutions among native peoples. (We note that the French practice 
of direct rule, with its project of making natives into Frenchmen, led French 
anthropology to be preoccupied with the structure, and particularly the capacity 
for rationality, of "primitive thought.") Along with others (Clifford & Marcus 
1986), Asad (1986) has argued that the basic intellectual and methodological 
presuppositions of anthropology are permeated by the unequal power of col- 
onizer and colonized. 

Variation in the authoritativeness and centrality of knowledge across soci- 
eties raises broad theoretical issues in the sociology of knowledge. Particularly 
fascinating is the contrast between France, on the one hand, and England and 
the United States, on the other. First, one can note that Bourdieu and Foucault, 
the contemporary theorists who have drawn the strongest links between knowl- 
edge and power, are both French. Their work seems to resonate with that of 
their French forebear, Durkheim, in stressing the overwhelming power of 
society vis à vis its members. Second, as Michele Lamont (1987) has pointed 
out in examining the career of Jacques Derrida, intellectuals have a more 
central cultural role and play to a broader public than do academic intellectuals 
in the United States. In a fascinating study, Priscilla Clark (1987) points to the 
distinctiveness of French literary culture, the central role intellectuals play in 
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French life, and some of the infrastructure, such as government patronage, that 
supports French writers (see Clark & Clark 1977). 

There are persistent differences in the intellectual directions of French versus 
Anglo-American intellectual life (see Wuthnow 1989). Enlightenment skepti- 
cism took an abstract, radical form in France while remaining staunchly em- 
piricist in England (Krieger 1970). Payer (1988) offers rich anecdotal evidence 
that contemporary French medical research and practice privilege theory over 
empirical findings, while British physicians accept only the most narrowly 
drawn empirical claims. Lamont (1992) finds that members of the French 
middle class make much sharper and more hierarchical cultural distinctions 
than equivalent Americans do. 

A focus on institutional authority can account for both the intellectual 
predilections and the relative centrality of French versus Anglo-American 
intellectual cultures. One can begin by noting the very different histories of 
the core institutions that supported knowledge production in France versus 
those in England. The Academie Francaise, founded in 1635, was the first of 
several academies established by the French state to enhance French science 
and culture. Its eight members (shortly expanded to forty), “Les Immortels," 
were appointed for life and received substantial stipends. Their primary task 
was to formalize the rules of the French language, to maintain its "purity," 
and to develop it for the arts and the sciences. The Academie awarded literary 
prizes and directed the flow of other government sinecures, thus guaranteeing 
that elite approval of literary work would provide financial rewards even in 
the face of commercial failure. The Paris Academy of Sciences, founded in 
1666, had fifteen members who gave scientific advice to the royal adminis- 
tration, becoming "France's acknowledged arbiter of scientific and technolog- 
ical activity" (Hahn 1971:21). 

The Royal Society of London, chartered by the British Crown in 1662, 
received no royal financial support and thus became a prestigious gathering 
of leisured gentlemen who conducted experiments and shared the results of 
their scientific work. Thus intellectual authority in England was based on 
shared observation and mutual exchange of ideas among a cultivated elite, 
while in France knowledge was grounded in a hierarchical system of intellec- 
tual authority. 

It is instructive to contrast the histories of the first great national dictionaries 
produced in France versus those in England. The French Academy's forty 
members immediately set about producing an authoritative dictionary of the 
French language, beginning the work in 1639 and completing it in 1694. A 
major English dictionary waited a hundred years until Samuel Johnson, be- 
tween 1746 and 1755, sponsored by a group of commercial publishers who 
sold subscriptions to finance the venture, single-handedly wrote his Dictionary 
of the English Language, aided only by a few helpers paid from his own pocket 
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and by friends who loaned the many books from which he extracted illustrative 
quotations (Bate 1975:240-60). Thus even in its conception of language, 
French practice created an authoritative codification of "pure" French, while 
Johnson's English dictionary offered an individual rendition of the best English 
usage. 

Progress in the new sociology of knowledge has come especially from what 
we might call the “middle” level of analysis. Focussing neither on large-scale 
forces like class or the capitalist economy, nor on influences such as the 
intellectual milieu or interests of individual actors, recent theoretical and em- 
pirical work has explored how authority, power, and practices within institu- 
tions shape knowledge. 


IDENTITY, BOUNDARIES, AND DIFFERENCE 


The concern with power has taken a second form in the work of Pierre Bourdieu 
and his collaborators. Bourdieu (1988) has turned his powerful anthropologist’s 
eye on the practices that define prestige and maintain power within his own 
academic milieu as well as on the cultural strategies of Algerians (1977) and 
of the French (1984). Reminiscent of Weber's (1968 [1920]) analysis of status 
group competition (Collins 1975), Bourdieu (1984, Bourdieu & Passeron 1977) 
has examined how status groups benefit from having the knowledge they 
possess defined as valued or legitimate knowledge. 

Bourdieu's work extends the sociology of knowledge in several respects 
(despite some criticisms of his empirical claims—Lamont & Larreau 1988, 
Lieberson 1992:6—7). First, through his concept of the "habitus," Bourdieu 
(1977, 1984) treats formal, academic knowledge as similar to other kinds of 
social knowledge—less knowledge of the world than knowledge of how to 
operate within it. He extends social knowledge more deeply into the person, 
examining learned habits of using and inhabiting space, the body, and time. 
By focussing on "practice" (Bourdieu 1977, Bourdieu & Wacquant 1992), he 
treats knowledge, including the valued knowledge of academia or of the 
cultural elite, as an embodied set of skills and habits that people use with more 
or less dexterity to achieve strategic advantage. But that same knowledge 
reproduces the larger system of social distinctions and social hierarchies within 
whose terms people actively maneuver. 

At the same time, Bourdieu (1969, see Ringer 1990) has argued that all 
knowledge is located within larger "intellectual fields" so that the meaning of 
knowledge depends on its relation to the field as a whole. Thus “orthodox” 
and "heterodox" positions exist in relation to a field of intellectual power 
relations. Intellectual fields are, in turn, embedded in larger "cultural fields"; 
both orthodox and heterodox positions share taken-for-granted “doxa.” 

In Distinction (1984), Bourdieu examines the internalized "taste" people 
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from different social milieus use to make social and cultural distinctions. Like 
Meyer (1977), Bourdieu emphasizes the larger lessons conveyed by a hierar- 
chical, stratified system of education, rather than simply the relatively advan- 
taged or disadvantaged position of particular social groups within it. Class or 
status group advantage, captured by Bourdieu's (1984) term "cultural capital," 
is not simply accumulated cultural expertise. Rather, like economic capital, 
cultural capital includes the taste, confidence, and familiarity that allows the 
culturally advantaged to reap a higher return on cultural investments than do 
those who begin with less. 

In a brilliant Americanization of Bourdieu's arguments, DiMaggio (1987) 
examines the source of distinctions among artistic genres. He argues that social 
groups invent and maintain bounded cultural genres in order to communicate 
status group membership in face-to-face interaction (Collins 1981). DiMaggio 
(1992) has analyzed the historical process of creating "institutions with the 
power to establish authoritatively the value of different forms of culture: in 
effect to create and to defend the boundaries among varying kinds of aesthetic 
... products and practices" (p. 21). DiMaggio (1982) has shown how nine- 
teenth-century Boston elites constructed the distinction between high and 
popular culture by founding organizations that could monopolize cultural 
objects (a high-culture repertoire), sacralize high culture (in distinct spaces 
with an awe-filled atmosphere), and legitimate the high-culture/popular-culture 
distinction. Related work by Levine (1988) traces the complex nineteenth-cen- 
tury process of bounding off high from popular culture. Beisel (1992) analyzes 
the active effort by nineteenth-century American reformers to construct a moral 
boundary between literature and obscenity. 

As the work collected in Lamont & Fournier (1992) suggests, distinctions 
are often drawn to reinforce social inequalities. Schwartz (1981) considers a 
deeper issue at the intersection of the cognitive and the social: why do vertical 
classifications universally connote social and moral inequality? Pointing out 
that the original Durkheimian attempt to root social classifications in social 
structures is circular (Durkheim & Mauss 1963 [1903], Schwartz looks to the 
universal dependence of children on larger, more powerful adults for the source 
of "vertical classification." 

Zerubavel (1991) catalogs the varied ways in which human beings make 
distinctions in everyday life. In Terra Cognita (1992) he traces the complex 
process through which European knowledge of the new world was reconfig- 
ured as people attempted to integrate new experiences into established cogni- 
tive structures. À big gap nonetheless remains between cognitive and social 
accounts of the ways people form and maintain boundaries. Goffman's (1974, 
1983) later work moved toward a formal analysis of the ways cognitive frames 
bound and organize social interactions. But sociologists have thus far made 
less use than they might of work by social psychologists on how people make 
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and use social categories. Baron & Pfeffer (1993:14) note that social categories 
"can be readily induced and need not rely on ... permanent or ascriptive 
distinctions. The social psychological literature is replete with studies invoking 
strong group identification and intergroup polarization, even with only the 
slightest and most transitory experimental manipulations (Kramer & Brewer 
1984)." They note further that “[o]rganizations are certainly very much in the 
business of creating categories, such as departments, ranks, and job titles" (see 
Baron 1986, Lansberg 1989). 

Boundaries of race, nationality, and religion are of special interest because 
they appear as naturalized, primordial categories even when they are clearly 
socially constituted (Barth 1969, Calhoun 1993). Fredrickson (1981) has de- 
scribed how different systems of racial categories developed in the United 
States and South Africa. Anthropologists like Abner Cohen (1969; 1974) have 
shown how groups sharpen ethnic and religious boundaries as they move into 
new economic niches. Olzak (1992) has used ecological arguments to account 
for heightened ethnic group conflict. 

Researchers have shown how larger organizational actors influence the form 
of cognitive categories and social boundaries. Sexual boundaries are accentu- 
ated when political leaders seek to tighten group boundaries (Davies 1982). 
Following Aries (1962), historians have suggested that heightened gender 
boundaries early in the modern period (Laqueur 1990) were part of a general 
European process of social segregation, drawing sharper distinctions among 
classes and ages as well as between the sexes (see Farge & Davis 1993). 
Cornell (1988) has examined how American Indians came to define themselves 
as tribes in response to the American state's insistence on negotiating only 
with tribal groups. Montejano (1987) has analyzed how racial divisions crys- 
tallized in twentieth-century Texas, reinforcing an understanding of racial 
categories as dynamic and historically contingent (Omi & Winant 1986). One 
of the most powerful ways of categorizing persons in the contemporary world 
is that of national citizenship. Brubaker (1992:1) contrasts the French under- 
standings of nationhood and citizenship (“state-centered and assimilationist") 
with those of Germany (“volk-centered amd differentialist"), tracing the his- 
torical sources of these differences. Sahlins (1989) has examined national 
boundaries, looking at how the hardening of national boundaries affected the 
identity and experience of frontier populations, while Watkins (1991) has 
documented the increasing articulation between nation and demographic be- 
havior. 

One of the most ambitious attempts to think about boundaries in fresh ways 
is Abbott (1988). Abbot conceptualizes professions as actors in a system, 
competing to define and establish exclusive control over jurisdictions. An 
academic knowledge system allows a profession to defend its jurisdiction, in 
part by more clearly defining its borders (p. 56). When professions attempt to 
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raid others' jurisdictions, they develop intellectual strategies, such as abstrac- 
tion or reduction, to subsume or displace the knowledge claims of their rivals 
(pp. 98-108). Thus jurisdictional claims and knowledge frames are intimately 
linked (see Gieryn et al 1985, Halpern 1992). Gieryn (1983) has shown, for 
example, that scientists demarcate science from nonscience differently in dif- 
ferent circumstances to justify claims for authority, autonomy, and resources. 

Power, knowledge, and boundaries are brought together in new ways in two 
recent literatures, those of “postmodernism” and “feminist epistemology.” 
While the topic of postmodernism is much too large to be addressed here (see 
Arditi 1993), in essence postmodemists argue that a new “order of things” has 
emerged in which the traditional categories that separated kinds of knowl- 
edge—or that separated truth from fiction, high from popular culture, and the 
sacred from the profane—no longer hold. Baudrillard’s (1983, 1988) postulate 
of a new hyperreality created not as a representation of already existing realities 
but from the power of signs themselves and Donna Haraway’s (1991) argu- 
ments for the embeddedness of cybernetics in every aspect of social reality 
today provide examples of such arguments. 

The motif of the constituted subject suggested by Foucault has been devel- 
oped along independent lines by contemporary feminists. In some ways these 
developments can be seen as a return to earlier concerns of the sociology of 
knowledge, in particular Mannheim’s (1936) efforts to find a correlation be- 
tween ideas and their location within the social structure. For feminist theorists, 
however, differences in ideas are not consequences of the different “interests” 
of social groups, but of the differential effects of power in the constitution of 
subjects. Feminists (Haraway 1991) have criticized Foucault for his failure to 
recognize differences in the ways power penetrates people belonging to dif- 
ferent social categories (gender, race, sexual preference), thereby constituting 
subjects differently, including the generation of gendered, or racialized, 
“knowledges.” For them, the constitution of “difference” has to be made a 
fundamental element of analysis, along with the always partial and situated 
nature of knowledge. 

The search for a feminist epistemology has taken several forms, which 
provide an interesting illustration of the varied ways knowledge may be so- 
cially shaped or determined. First are “standpoint theories,” like those of Marx, 
Mannheim, and other pioneers in the sociology of knowledge. Feminist schol- 
ars (Smith 1979, Hartsock 1987, Collins 1990) argue that the oppressed, or 
those excluded from power, have a unique vantage point from which to un- 
derstand aspects of the world that may be invisible to dominant groups. 

One explanation of putative differences between men’s and women’s ways 
of knowing traces these differences to childhood experience (Chodorow 1978, 
Benjamin 1988). If girls differentiate less from their mothers than boys do and 
women remain enmeshed with their mothers or their children, they are less 
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likely to experience themselves as separate from the things they study (sub- 
ject-object distinction, see Keller 1983), to use modes of thought that sharply 
separate or disaggregate what is studied (analytic reasoning), or to organize 
knowledge hierarchically (deductive reasoning). À related position is a variant 
of arguments linking knowledge to authority structures: If men are vested with 
social authority, then authoritative knowledge is ^what men have to say" and 
it "carr[ies] forward the interests and perspectives of men" (Smith 1987:18). 
If women are responsible for the private, relational aspects of social life and 
are excluded from public systems of authority, they are less likely to participate 
in what is currently taken to be universal, analytic, objective knowledge. These 
arguments focus less on women as knowers than on the purported maleness 
of modern science. Dorothy Smith (1987), Evelyn Fox Keller (1985), and 
others (Schiebinger 1993) have argued that the modern sciences were specif- 
ically constructed as male enterprises, accruing prestige and power from male 
styles of thought. In this way, many recent explorations of feminist epistemol- 
ogy transcend the distinction between old and new sociologies of knowledge 
by analyzing both how women's knowledge differs systematically from that 
of men (although most of this work is speculative rather than empirical) and 
how the very nature of what is taken to be knowledge is shaped by male gender 
(Flax 1983). 


INFORMAL KNOWLEDGE 


There has also been a resurgence of work on informal knowledge (Geertz 
1983), that is, the knowledge ordinary people develop to deal with their 
everyday lives (Gramsci 1971). Whether such literature is properly sociology 
of knowledge, or whether it belongs within a broadened sociology of culture 
or a sociology of consciousness remains to be seen. But at least some works 
are of interest here because they examine how ordinary people actually take 
up and use (or reject) the knowledge generated for them by elites (Gamson 
1992, Swidler 1995). Billig (1987, 1992) analyzes the intellectual structure of 
ordinary thinking and the uses people make of popular culture. Riessman 
(1990) has examined how people construct narratives of their lives. 

The sociology of formal and informal knowledge has converged, partly 
through the work of the new cultural historians who have drawn explicit links 
between folk knowledge and high culture (Greenblatt 1988, Hunt 1989, Muker- 
jee & Schudson 1991), and partly because of intellectual innovations, like 
attention to “discursive fields" which allow scholars to discover larger organ- 
izing principles within popular forms of knowledge. 

The emphasis on "practice" in cultural studies (Ortner 1984) has had a 
salutary effect in making explicit the problem of how ideas become plausible 
to those who hold them. William Sewell, Jr. (1992) has made an important 
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contribution by clarifying how people reproduce social structures by acting on 
the cultural "schemas" embedded in the world they inhabit. The fact that 
cultural schemas are "capable of being transposed or extended means that the 
resource consequences of the enactment of cultural schemas is never entirely 
predictable. A joke told to a new audience, an investment made in a new 
market, an offer of marriage made to a new patriline, a cavalry attack made 
on a new terrain, a crop planted in a newly cleared field or in a familiar field 
in a new spring—the effect of these actions on the resources of the actors is 
never quite certain" (p. 18). Just as cultural schemas provide the bases for 
practices that reproduce structures, so Sewell has shown in earlier work (1974) 
how the plausibility of a new ideology—in this case, socialism among the 
workers of nineteenth-century Marseille—depends upon existing social and 
cultural arrangements that make the ideology seem enactable in practice (see 
Mann 1973). 


CONCLUSION 


Little of the work reviewed here explicitly locates itself in the sociology of 
knowledge. Despite diverse disciplines, perspectives, and substantive foci, 
however, these literatures allow at least some preliminary conclusions. First, 
social authority shapes the authoritativeness of knowledge, affecting both the 
authority knowledge can effectively claim and the forms that knowledge claims 
take (see Asad 1993). Second, distinctions, social and intellectual, are made 
along lines cf social differentiation, particularly hierarchical ones. Third, shifts 
in the media through which knowledge is transmitted, especially the transition 
to print, have dramatic effects on the entire organization of knowledge systems. 
Fourth, to explain why new knowledge emerges and to account for the social 
effects of ideas, scholars need to pay careful attention to factors that directly 
affect the institutions and actors that produce and distribute knowledge. Fifth, 
analysis of how the social location of actors affects their knowledge must 
account for the constitution of actors themselves. Sixth, knowledge and power 
are intimately related because power allows people to enact realities that make 
their knowledge plausible. 

The new sociology of knowledge, not yet a unified field, does not bave a 
single problematic around which debates revolve. Nonetheless, there are op- 
portunities for fruitful research along the many lines where the literatures 
brought together here converge and diverge. For example, the issue of the 
“authoritativeness” of different kinds of knowledge raises many questions, 
among them for whom and to what extent officially approved forms of knowl- 
edge really do have authority. Hoggart (1957) argued many years ago that 
English working-class culture remained predominantly oral into the twentieth 
century, and Whitehead's (1974) study of occultism suggests widespread re- 
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jection of the authority of modern science. But this raises the question of what 
it means for knowledge to be "authoritative." Is it necessary that most lay- 
persons actually share it? Or, like medieval Catholicism, are forms of knowl- 
edge authoritative because established political authorities accept them? Or 
because they define public discourse, whatever people may think privately? 
And do the officially authoritative forms of knowledge structure the claims 
even of heretics and dissenters? 

If social authority structures knowledge, it is important to ask precisely what 
about authority influences either the form or content of knowledge. Laitin 
(1986), for example, has made a strong case that British colonial rule had 
enduring effects in Nigeria, making "ancestral village" a central political 
identity while deemphasizing the political significance of religion. But how 
precisely did British rule privilege village identity, and why did this identity 
remain salient in post-colonial Nigerian politics? More generally, is it an 
authority's control over specific incentives and sanctions, or rather control 
over central symbols that anchors systems of knowledge? Or do authorities 
influence knowledge through their control over the institutions of intellectual 
life? How do conflicts between political and intellectual power affect the 
structure of knowledge? 

Similar questions can be asked about how social inequalities structure cat- 
egorical distinctions. Most intriguing is how social categories become natural- 
ized. Is it only, as Foucault claims, because powerful institutions enact dis- 
tinctions that they come to appear incontrovertibly real? Or can classifications 
crystallize on more purely cognitive or interactionist grounds? And under what 
circumstances do distinctions remain hazy? 

These questions and others like them—about how media of intellectual 
transmission structure knowledge or how differing standpoints influence def- 
initions of what constitutes knowledge—suggest that researchers would benefit 
from greater awareness of the cumulative gains being made in the sociology 
of knowledge. Such awareness might stimulate explicit attention to the con- 
cepts and causal models that underlie particular historical or comparative 
arguments. 
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Abstract 


This paper reviews recent developments in the application of multilevel models 
to substantive problems in sociology. There is no single multilevel model in 
sociology, but rather a set of more or less closely related approaches for 
exploring the link between the macro and micro levels of social phenomena. 
Methodological developments of the last ten years are discussed and contrasted 
with older methods. Illustrative examples of how multilevel analysis has con- 
tributed to sociological knowledge are provided for several areas of the dis- 
cipline, including demography, education, stratification, and criminology. Cau- 
tions in the use of these models for empirical research are discussed, along 
with possible further developments. 


INTRODUCTION 


Multilevel models are used in sociology to specify the effect of social context 
on individual-level outcomes. The idea that individuals respond to their social 
context is a defining claim of the sociological discipline, which is found in 
Marx’s work on political economy (1846), in Durkheim’s studies of the impact 
of community on anomia and suicide (1897), in Weber’s research on how 
religious communities shape economic behavior (1905), in Merton’s work on 
communities, relative deprivation, and social comparison theory (1968), and 
in Berelson et al’s (1954) research into the effect of social context on voting. 
Blau (1960) and Davis (1961) published important papers devoted to contex- 
tual effects, and a thriving research agenda into these issues existed in the 
1960s and 1970s (Boyd & Iversen 1979, Blalock 1984, Iversen 1991). More 
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recently, Coleman (1986) classified sociological theories into three groups 
according to their multilevel content. In the first group, variation in a dependent 
variable is explained through independent variables that apply to the same 
social level (e.g. country, organization, local community, individual). In the 
second group, variation in a dependent variable at one level is explained by 
processes that operate at a higher level. In a third type, variation in outcomes 
at one level is explained by variation in outcomes at a lower level. Theories 
inthe second and third groups are multilevel theories, though nearly all existing 
multilevel research falls into group two. 

Blalock (1984) reviewed the theoretical and methodological literature on 
multilevel models about 10 years ago. Most of his observations—particularly 
about the problems of interpreting multilevel effects and about consistency of 
estimates—are still valid. However, much has changed in the past decade, too. 
The most useful metatheoretical formulation of micro and macro levels of 
social reality and the relationship between those levels continue to be active 
topics for debate in theoretical sociology (Alexander et al 1987, Huber 1991). 
Multilevel theories for particular social phenomena have become more refined. 
Moreover, new statistical techniques have been applied in several areas of 
sociology, with useful results. These new methods are not a panacea for the 
problems raised by Blalock in his 1984 review, as Mason (1991) notes, but 
they do represent a significant advance over earlier methods, and they have 
led to new types of substantive analyses as well. 

Because so much work is ongoing in so many different areas, we cannot 
attempt an exhaustive review of all sociological research that is arguably 
multilevel. Instead, we selectively review research in order to describe the 
recent methodological developments, to show how the newer methodological 
work is being used in substantive areas, to discuss how the new models relate 
to each other, and to identify areas where further developments are needed or 
in process. 

The term multilevel can be used in different ways. There is no single 
multilevel model; there is rather a variety of models that have been used to 
analyze social processes postulated to operate at more than one level of anal- 
ysis. One can characterize theories, methods, and data collection as either 
unilevel or multilevel; while they tend to be associated, the association is not 
perfect. Researchers sometimes theorize at multiple levels but use data at only 
one level. They sometimes theorize at one level, but use data at a different 
level. The issues raised by these efforts (aggregation, the "ecological fallacy" 
etc) are beyond the scope of this review (see Blalock 1984 for a summary). 
In this review, we use the term “multilevel model" to designate the specifica- 
tion and testing of multilevel theories with multilevel data. 

Blalock (1984) defined contextual analysis as follows: “The essential feature 
of all contextual-effects models is an allowance for macro processes that are 
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presumed to have an impact on the individual actor over and above the effects 
of any individual-level variables that may be operating" (p. 354). If we gen- 
eralize his use of the term individual to apply to any unit that is micro relative 
to some other macro level in the analysis, his definition is still quite serviceable. 
The notion of context is quite general and can include spatial contexts, (e.g. 
countries, states, communities), temporal contexts (i.e. history), organizational 
contexts (classrooms, schools, firms), and social/cultural/economic contexts 
(ethnic groups, social classes, economic sectors). Analogous models are em- 
ployed when the "context" is the method of data collection or some other 
common property for a set of data points, as in meta-analysis—the use of 
statistical models to reconcile results from different studies on the same topic— 
or the analysis of factorial surveys (Hedges & Olkin 1985, Braum 1989, Hox 
et al 1991, Bryk & Raudenbush 1992). The theoretical specification of context 
can be simple, or rather involved (Blalock 1984, Iversen 1991; or at a more 
metatheoretical level, see Alexander et al 1987). Multiple contexts can apply 
to a given unit. Contexts can be overlapping or nested. They can have fuzzy 
boundaries or clear ones. Not surprisingly, more complex specifications can 
introduce formidable methodological difficulties. 

Generally speaking, multilevel models explain microlevel outcomes in two 
ways: (i) by showing that parameters of models specified at the micro level— 
where microlevel covariates are used to explain microlevel outcomes—are a 
function of context, and (ii) by showing that this micro-macro relationship can 
be expressed in terms of characteristics of the context, which take the form of 
macrolevel variables. Two types of macrolevel variables are commonly found 
in multilevel models: (a) context-specific means or higher moments of micro- 
level variables, and (b) global variables (Lazarsfeld & Menzel 1969) that are 
not expressible as functions of microlevel variables, or at least not the micro- 
level variables found in the microlevel equation. 

We have already noted implicitly that multilevel models are also called 
contextual models. Readers of the recent literature will have noted many other 
names for these models, including, but not limited to, hierarchical linear model, 
hierarchical linear regression, random coefficients model, hierarchical mixed 
linear model, or bayesian linear model (Kreft et a] 1990). To a certain extent, 
the proliferation of names comes from the statistical properties of the various 
modeling strategies used to analyze multilevel data. These names apply pri- 
marily to the extensions of contextual regression analysis that have been used 
extensively in sociology since the 1960s (as reviewed in Boyd & Iversen 1979, 
or Blalock 1984). 

However, several other types of multilevel models are also in use or being 
developed in sociology, including (i) extensions of contingency table analysis 
to include multilevel effects; (ii) extensions of event-history analysis to include 
multilevel effects; (iii) endogenous switching regressions; (iv) extensions of 
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latent-variable models to include multilevel effects; (v) the development of 
micro-macro (as opposed to macro-micro) models; and (vi) nonstatistical meth- 
ods of multilevel analysis. We discuss these alternative methods later in this 


The familiar model for contextual effects and more recent extensions are 
the models most widely associated with the term multilevel model. These are 
regression models that are linear in their coefficients. Generally speaking, they 
are used to analyze data that consist of multiple macro units (contexts) and 
multiple micro units within each macro unit. These regression models can be 
expressed in two algebraically equivalent forms: (i) as an equation relating a 
microlevel outcome to a set of microlevel variables along with a set of equa- 
tions in which the coefficients of this microlevel model are expressed as 
functions of macrolevel variables, or (ii) as a single equation where the micro- 
level dependent variable is expressed as a function of both micro and macro 
variables. This second form generally includes interactions between the micro 
and macro variables. 


The Basic Model 


The major advances in the past ten years concern not the functional form 
relating micro and macro variables, but rather a more sophisticated treatment 
of the error structure for these models. Whereas the older models can be 
characterized as fixed effects regression models, the new models specify the 
regression coefficients as random effects. In a fixed effects multilevel regres- 
sion model, the microlevel coefficient is expressed as an exact function of 
macrolevel variables. Random effects multilevel models, in contrast, contain 
error terms in the macro equations. The inclusion of these error terms at the 
macro level implies a more complex error structure in the single-equation 
version of the multilevel model. The use of random coefficient models allows 
the data analyst to decompose the variance in the dependent variable into the 
within-context variance and the between context variance, and to study these 
two sources of variation for the microlevel outcome. Thus, random coefficient 
multilevel models are a type of variance components model. 

The distinction between the fixed effects multilevel model and the random 
effects multilevel model can be seen in a simple example, adopted from Bryk 
& Raudenbush (1992). Suppose that math achievement in elementary school 
(Y) for individual “i” is a function of socioeconomic status (SES), and suppose 
that the relationship follows the following simple equation: 


Yi= ao + BiXi t & 


where X; is the SES of student i. As written, the intercept Qt gives an estimate 
of the expected math achievement for a student whose socioeconomic status 
score is zero. Since zero is not a meaningful value for socioeconomic status, 
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we can "center" X (e.g. Willms 1986, Iversen 1991, Bryk & Raudenbush 1992) 
as a deviation from the mean SES in the sample. l 


Yi = Bo + Bii D +e 


Here Bo provides an estimate of the expected math achievement for a student 
whose SES is at the mean, and B, provides an estimate of the effect of a unit 
of SES on math achievement. This equation is a typical microlevel equation, 
which specifies the achievement of all students with mean SES to have the 
same expected math achievement D, and specifies the effect of SES (Bj) to be 
the same for all students. In other words, the effects in this model are equal 
for all students. In an extreme alternative to this fixed effects model, the analyst 
might hypothesize that each student has his/her own unique Bo, which would 
predict his/her math achievement if this student were to come from a mean 
SES family. Furthermore, the analyst might argue that each student has his/her 
own unique response to the academic advantages or disadvantages related to 
family SES. Under the assumption that each student has his/her own coeffi- 
cients, the model now becomes: 


Y; = Bo + Bu(Xi - X) + & 1 


where fio; is the intercept for student “i,” and Bj; is the effect of SES for student 
“4.” Because the parameters of Equation 1 vary by individual, this model is 
called a varying parameters model. If it is assumed that these parameters are 
fixed but unknown, then the model is a fixed effects varying parameter model 
(Judge et al 1985, Greene 1990). If the students in the sample constitute the 
entire population of students, the conceptualization of these parameters as 
varying but fixed is plausible on theoretical grounds. These parameters could 
not be estimated without further constraints, since the number of parameters 
is twice as great as the population size. Usually, however, the students in the 
study would only be a sample from some larger population, and it would make 
more conceptual sense to view the individual-level coefficients in Equation 1 
as random draws from some larger population of coefficients, just as the 
students in the sample are random draws from some larger population of 
students. 

Under this assumption, the coefficients can be reexpressed to indicate that 
they are varying and random. 


Boi = Bo + vi 2 


Bu = Bi + @ 
Inserting these values back into the microlevel equation gives: 


'See Bryk and Raudenbush (1992) or Iverse.a (1991) for an extended discussion of the various 
types of centering employed in multilevel work. 


336 DIPRETE & FORRISTAL 


Y, = Bo + Bii - X) + (e + Vet (Ki -X Jw} 3 


This model, which is sometimes called the Swamy regression model (1971), 
is similar to Equation 1 except that it has a more complex, heteroskedastic 
error structure that includes the random components from Equation 2 multi- 
plied by microlevel covariates. The statistical consequences of this change are 
fairly straightforward. Under the standard (strong) assumptions found in ele- 
mentary discussions of regression, ordinary least squares (OLS) estimates of 
Bo and B, are consistent, but OLS estimates of their standard errors are incon- 
sistent, and OLS is less efficient than alternative estimation procedures. 

Model 3 is clearly more general than Model 1, but it is obviously unsatis- 
fying from a substantive point of view; it provides no explanation for how 
these coefficients vary across individuals. The random effects multilevel model 
can be viewed as a way of imposing further structure on Equation 3 and, in 
the process, providing an explanation for the varying coefficients (Kreft et al 
1990)? It does so in the specific case where the analyst can argue that these 
microlevel units are located in different and distinguishable social contexts, 
and also that the properties of these social contexts provide an explanation for 
variation in the microlevel coefficients. 

In the student example above, a natural context is the school (other possi- 
bilities are the classroom or the school district). If we use the letter “j” to index 
the school of each student, we can replace Equations 2 by a more substantively 
meaningful set of equations. 


Boy = Yo + v 4 
Biy = to 


In Equations 4, all the individual-level variation in microlevel coefficients 
is attributed to the school, and every student in a given school is presumed to 
have the same microlevel coefficients. In other words, the intercepts and slopes 
vary systematically by school, but there is no further variation within schools. 
One can go further and model the expected value for each school’s coefficients 
as functions of that school’s characteristics. For example, letting Z;j represent 
the mean SES in school “j”, and letting Z2, be a dummy variable that indicates 
whether school j is a Catholic school or not, we could reexpress Equation 4 
as: 


Boij = Yoo + ‘YorZ1j + ‘YorZ2, + Vj 5 


2The seminal discussion of this approach to the linear model is found in Lindley & Smith's (1972) 
discussion of bayesian approaches to regression. Bayesian approaches are closely related to—and 
in some respects equivalent to—Stein-like estimators (Efron & Morris 1972, or Judge et al 1985 for 
a recent extended discussion). 
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Bij = Yio + YuZy + Yi2Zy + 0 
or in a single equation form as: 


Yy = Yo + prZy + YooZy + Yio(Xy — X ) + Yu Zy (Xy - X) 6 
+ 31 Zy (Xy — X) + (& + v (Xy - XX0} 


In Equation 6, the macrolevel variables interact with the microlevel vari- 
ables, and the error structure in brackets contains both microlevel terms and 
macrolevel terms. If the data contain multiple microlevel observations within 
the same context, Equation 6 implies that the errors will be heteroskedastic 
and correlated across microlevel units. Under the usual assumptions, OLS 
estimation of Equation 6 will produce consistent estimates of the coefficients, 
but inconsistent estimates of their standard errors. 

If the analyst knew the variance-covariance matrix for the error structure of 
Equation 6, it would be straightforward to estimate Equation 6 with generalized 
least squares (GLS) (see e.g. Goldstein 1987). Because this matrix is generally 
not known, however, it must be estimated along with the coefficients in 
Equation 6. The best methods for doing this are iterative methods such as the 
EM algorithm, which alternately estimates the parameters of Equation 6, then 
the variance-covariance matrix of the error, and so forth until convergence is 
reached (Mason et al 1983, Goldstein 1987, Bryk & Raudenbush 1992; see 
Kreft et al 1990 for a discussion of alternative iterative methods). These 
methods have been incorporated into a collection of standalone statistical 
programs such as GENMOD, HLM, ML3, VARCL, and routines that run under 
SAS, GAUSS, and BMDP (see Kreft et al 1990 for a discussion of many of 
these routines). 

The basic difference between the random effects multilevel model shown 
in Equation 6 and the older fixed-effects multilevel model found in most of 
the literature reviewed in Boyd & Iversen (1979) or Blalock (1984) concerns 
the error structure shown in Equations 5. The fixed effects model assumes 
these errors are zero. This assumption converts Equation 6 into a standard 
regression that contains interaction terms, and the coefficients of this regression 
can be estimated using OLS without any of the statistical difficulties described 
above. As Bryk & Raudenbush (1992) note, the choice of fixed effects or 
random effects is not arbitrary, because the hypothesis of zero errors in the 
macro equations can be tested. It is always possible to convert a random-effects 
formulation into a fixed-effects formulation, in which the error variances in 
Equation 5 become zero, by including dummy variables for each context (with 
appropriate constraints to ensure identification) in Equations 4 or 5. This 
approach, however, has two difficulties: it introduces a large number of addi- 
tional coefficients, and it blurs the distinction between sample and population 
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that is preserved by the random-effects model. The random effects model is 
superior on both grounds. 

Equation 6 incorporates the assumption that both the intercept and slope 
coefficients are functions of context. One could restrict the model so that only 
some slopes vary while others are fixed, or one could specify that only the 
intercept coefficients vary, while all slope coefficients are fixed. One could 
also specify that some but not all of the error variances in the macro equations 
are zero. These hypotheses are all testable (Bryk & Raudenbush 1992). Finally, 
it is not necessary that all of the within-context regressions be identically 
specified; all that is required is that one or more of the microlevel coefficients 
be comparable across contexts (Wong & Mason 1989). 

The use of newer statistical techniques to study a particular multilevel 
problem can alter one's substantive conclusions? Furthermore, the newer 
techniques can lead to interesting new substantive analyses. The importance 
of context in shaping outcomes is illuminated by an analysis of the variance 
components of the multilevel model. For example, if most of the variation in 
stratification outcomes occurs between families, or if most of the variation in 
achievement occurs between schools, one might reasonably conclude that the 
family is a major determinant of stratification outcomes, or that the school is 
a major determinant of achievement. In contrast, a finding that almost all the 
variation in stratification outcomes was within families, or that almost all the 
variation in achievement outcomes was within schools leads to different sub- 
stantive conclusions. These types of analyses are natural products of a ran- 
dom-effects multilevel analysis (Goldstein 1987, Bryk & Raudenbush 1992, 
see also Hauser & Mossel 1985). 

Random-effects multilevel models also allow more sophisticated attention to 
the task of estimating microlevel models for specific contexts. Within-context 
models are of particular interest to policy analysts when the contexts (e.g. 
schools) can be manipulated if their performance is wanting (Willms 1992). 
Multilevel data provide three distinct estimators for within-context coefficients. 
Tbe analyst could use within-context data to estimate a within-context regres- 
sion, but the estimates will be imprecise if the within-context sample is small. 
Alternatively, the analyst could use the macro model shown in Equations 4 or 5 
to obtain estimates of the within-context coefficients, either by using the 
estimated mean of Equations 4, or by plugging in the appropriate values for the 
macro variables in Equations 5. This second estimator uses data for all contexts, 
not just the context for which coefficient values are desired, because the 
regression coefficients of the macro equation are estimated with data for all 


3See, for example, Kreft & de Leeuw's 1988 demonstration how conclusions about the effects 
of a school's sex-ratio on individual reading ability changed when one used a random effects instead 
of a fixed effects multilevel model. 
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contexts. As a third possibility, one could weight the alternative estimators and 
average them. The random-effects multilevel model follows this third course; it 
provides anempirical-Bayes (EB) estimator for individual-level coefficients that 
is an “optimally” weighted average of the within-context estimate and the 
between-context estimate (Mason et al 1983, Bryk & Raudenbush 1992). This 
weighted average “shrinks” the within-context estimate toward the between- 
context estimate, with the level of shrinkage determined by the precision of the 
within-context estimator (e.g. Rubin 1989).* Because information is “borrowed” 
from other contexts, this technique provides more stable within-context esti- 
mates, though the new estimates may have greater bias if the macrolevel model 
is misspecified (Braun 1989, McCullagh 1989, Lewis 1989). 


ELABORATIONS OF THE MULTILEVEL LINEAR MODEL 


As Blalock (1984) and Iversen (1991) noted in their earlier reviews, the 
theoretical specification of context can become quite complicated. While the 
basic multilevel model discussed above incorporates a single context within 
which microlevel units are nested, multilevel effects can operate through 
multiple contexts at once, which can be either nested or overlapping. When 
multiple contexts are nested within each other (e.g. classrooms within schools 
within districts), the two-level multilevel model discussed above can be ex- 
tended by specifying each of the coefficients in the second-level equation as 
a function of the attributes of a higher level of context. Goldstein (1987), Kreft 
et al (1990) and Bryk & Raudenbush (1988; see also Bryk & Raudenbush 
1992, Lee & Bryk 1989) have made extended presentations of this model. 

A three-level model is also useful in the analysis of multilevel longitudinal 
data, i.e. multiple observations of micro-units within a set of macro-contexts. 
Bryk & Raudenbush (1987) proposed one such model to explain human de- 
velopment in an educational setting. In their model, the variables at the first 
level are time-varying individual-level variables, most notably, time itself. The 
second level is also specified at the micro level: it contains time-invariant 
individual effects, which are used to explain the first-level coefficients of the 
time-varying variables. The third level contains macro variables, which in Bryk 
& Raudenbush’s model were attributes of schools. Willms & Raudenbush 
(1989) proposed a different three-level model for longitudinal analysis. Their 
first level model contains student-level variables, measured at a particular time. 
The second level equation models the coefficients of the first-level equation 


4See also Clogg et al (1991) and Stasny (1991) for related uses of shrinkage estimators to stabilize 
within-group estimates when sample sizes are small. Also see the related discussion of empirical 
Bayes meta-analysis (Braun 1989, Bryk & Raudenbush 1992). Here the various studies function as 
alternative “contexts.” 
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as functions of school characteristics, which are measured contemporaneously 
with the individual-level measures in the first-level equation. These school 
characteristics at the second level are expressed as deviations from the average 
effects for the school over time. The third level model then specifies the 
coefficients of these time-varying school variables as functions of school 
characteristics averaged over time. This model is designed to distinguish 
the effects of short-term and long-term characteristics of schools on student 
achievement. 

Nonnested models have also been developed for sociological problems. One 
type of nonnested model arises when two macrolevel contexts are presumed 
to affect the same microlevel unit at the same time. By crossing the two 
dimensions along which these multiple contexts are defined, one obtains a 
larger number of multidimensional contexts, such that each micro unit is nested 
within a single multidimensional context. With this modification, the process 
can be studied with the usual two-level multilevel model (Braun 1989). 

A second type of nonnested model can be used to allow contextual effects 
to be heterogeneous within contexts. The usual multilevel model assumes that 
the effects of context are homogeneous for all units located in the same context. 
As Blalock (1984) noted, this assumption of homogeneity may often be un- 
warranted.5 As a remedy, Blalock suggested that actors be divided into groups 
that have homogeneous "attachments, dependencies, and vulnerabilities." 
DiPrete & Grusky (1990a,b) employed a similar strategy when they developed 
a three-level nonnested model to overcome the homogeneity problem in their 
analysis of temporal variation in the coefficients of the basic status attainment 
equation. Their three-level model used data from repeated cross-sectional 
samples of the American population. The first-level equation was a basic 
microlevel status attainment equation, with individual-level variables mea- 
sured at the different time points available in the repeated cross-sections. The 
second level expressed the coefficients of these individual-level variables as 
functions of lagged macro factors. The third level permitted the strength of 
the macro factors on individual processes to vary across individuals. DiPrete 
& Grusky used an individual-level variable—a measure of labor force expe- 
rience—to capture the differential vulnerabilities of workers to macro pro- 
cesses, but in principle these vulnerabilities could have been specified as a 
function of other macro variables such as firm size, labor market sector, etc. 

An alternative elaboration of the basic multilevel model involves relaxing 
the assumption that the dependent variable is a continuous, normally distrib- 
uted, random variable. Wong & Mason (1985) developed a hierarchical logistic 
regression model for multilevel analysis. Goldstein (1987) developed a hier- 


SIn a school context, for example, Willms (1986) speculated that the impact of school com- 
position on various outcomes might be a function of the initial ability of the pupil. 
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archical loglinear model. Other such work is ongoing (e.g. Kreft et al 1990, 
Goldstein 1991, Breslow & Clayton 1993). 


OTHER FORMS OF MULTILEVEL MODELS 


Most of the discussion to this point has concerned the analysis of data for 
multiple contexts, where several microlevel data points are available for each 
context. This type of data allows a variance components analysis that can 
separate the within-context and between-context contributions to total vari- 
ance. In some cases, however, researchers may develop a multilevel theory 
and collect multilevel data, but cannot use variance components multilevel 
models because the researchers do not have measurements for multiple micro 
units within each macro unit. A notable case in point is found in contemporary 
research in stratification and work. A central theme in the recent literature is 
the influence of work organization and labor market segment on stratification 
outcomes. Several scholars (Villemez & Bridges 1988, Parcel et al 1991, 
Tomoskovic-Devey et al 1993) have collected the names of employers from 
individual survey respondents to generate a sample of organizations from 
which additional information was collected. The combination of the individual 
and the employer surveys is a multilevel database on workers and their work 
context. The most recent data of this form comes from the National Organi- 
zation Survey (NOS), which combines microlevel data from the General Social 
Survey with macro data about the employing organizations of GSS sample 
members and their employed spouses. Unfortunately, because these datasets 
typically contain information on only one microlevel unit within each macro- 
level unit, it is not possible to estimate the variance components multilevel 
models described above, and corrections for the heteroskedasticity in random- 
effects multilevel models fit to these data must follow the modeling strategies 
discussed, for example, by Maddala (1977) or Muthén & Satorra (1989). 
Analyses with data such as the NOS can specify microlevel coefficients as 
functions of macrolevel variables, but they cannot estimate the “total effects" 
of context because only one micro observation is available per organization. 
Another important modeling strategy in stratification research—sibling mod- 
els—in effect does the reverse. Hauser & Mossel (1985, see also Hauser & 
Sewell 1986, Hauser & Wong 1989) note that their sibling model is a speci- 
fication for the contextual effects of the family of origin on stratification 
outcomes—in fact, it can be conceptualized as a random eifects model for the 
intercept of a regression of status on schooling. Unlike the variance compo- 
nents models discussed earlier or the models for the effects of work organiza- 
tion just discussed, Hauser et al's sibling model was not designed to provide 
a substantive explanation for varying coefficients of individual-level variables 
(e.g. schooling, motivation, mental ability) as a function of family-level vari- 
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ables (e.g. father’s occupational status, mother's education). Rather, it removes 
the error component in the status attainment equation that arises from unmea- 
sured family effects, which—because they are correlated with schooling—nor- 
mally create an upward bias in estimates of the effects of schooling on out- 
comes. Models of sibling resemblance have not yet been extended to consider 
family-based variation in other coefficients of the basic status attainment 
equation.’ A different multilevel modeling strategy has been employed in the 
analysis of mobility data arranged in contingency tables. As Iversen (1991) 
notes, some of the earliest contextual models were formulated as cross-classi- 
fications of microlevel and macrolevel variables (Kendall & Lazarsfeld 1950, 
Berelson et al 1954, Davis et al 1961). The more recent work goes beyond 
these early efforts and follows the same basic strategy used in multilevel linear 
regression models. In a single context (usually country), the mobility structure 
is reflected in the parameters of a loglinear or logmultiplicative model. Grusky 
& Hauser (1984) combined data for several countries and modeled the variation 
in parameters for each country's model as a function of country characteristics 
such as political structure and level of economic development (see also Hauser 
& Grusky 1988, Erikson & Goldthorpe 1992). These models are fixed-effects 
models. Random-effects multilevel loglinear or logmultiplicative models have 
not been applied to sociological research to our knowledge. Another related 
multilevel model is found in event-history analysis. Event-history models are 
related to the life-course analyses mentioned earlier (e.g. Willms & Rauden- 
bush 1989, Raudenbush & Bryk 1989), though time is measured continuously 
instead of discretely, and the outcome variable is discrete instead of continuous 
in event history models. Scholars have used multilevel event-history models 
to specify community effects in outcomes such as the timing of marriage or 
fertility (e.g. Billy & Moore 1992, Lichter et al 1992). Except in the case of 
models for unmeasured heterogeneity (discussed below), almost all the work 
to date has employed fixed-effects versions of these models, where interactions 
between macro and micro variables are added, but no adjustment is made for 
a possibly more complex error structure. Little work has been done with 
event-history models on the analysis of variance components or in the use of 
EB methods to obtain more stable estimates of context-specific event-history 
models (though see Braun 1989 for some discussion of these issues). Endog- 
enous switching regression models can also be viewed as a form of multilevel 
model (Greene 1990, for applications see, e.g. Dickens & Lang 1988, Gamoran 


SHowever, the downward bias on the schooling coefficient from measurement error appears to 
be roughly the same size as the upward bias on the schooling coefficient from omitted family 
variables (Hauser & Mossell 1985). 

TSearches for interactions between family characteristics and the schooling coefficient with con- 
ventional regression techniques have found little of interest to date Jencks et al 1971, Hauser 1973). 
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& Mare 1989). Researchers using this strategy specify separate regression 
models for each of a small number of distinct contexts (e.g. the primary or 
secondary sector of an economy, or the different educational tracks in high 
school), and they also specify a model that explains the observed distribution 
of micro units across the contexts. Because the number of contexts is small, 
macro variables cannot be introduced to provide quantitative explanations of 
between-context differences in regression coefficients.® 

Latent structure models are related in some ways to multilevel models. A 
latent structure model can be thought of as a type of multilevel model in which 
associations between microlevel variables are “explained” by the uncovering 
of latent classes which arguably generate the microlevel associations (Good- 
man 1974, Clogg 1981, McCutcheon 1987). A latent structure model is anal- 
ogous to a regression model in which the measured covariates are concep- 
tualized as indicators of latent contexts, without “true” direct effects of their 
own. The intercepts of the microlevel models are specified to vary by latent 
context. A computer search is done in order to identify latent contexts that 
minimize the measured direct effects of the covariates, and maximize the 
variation explained by the varying intercepts. The latent classes cannot be 
interpreted as pure contextual effects, however, because they will generally 
include individual-level common factors as well. 

Unmeasured heterogeneity models in event history analysis are also related 
to multilevel models (Heckman & Singer 1984, Manton et al 1992). The 
presence of duration dependence in event-history models is often interpreted 
to mean that units in the analysis differ in ways not captured by the measured 
covariates. These unmeasured factors will bias estimates of duration depen- 
dence, and correlations between these unmeasured factors and measured cov- 
ariates will bias the estimated coefficients of the measured covariates. The 
solution increasingly used in the analysis of event-history models is in effect 
to specify the intercept of event-history models as a random function of 
unmeasured person effects, then to specify a parametric or mixing distribution 
for these unmeasured effects, and then to incorporate this mixing distribution 
into the likelihood function. However, the mixing distribution is usually not 
given a substantive interpretation, and it will generally be a function of micro 
as well as macro factors. 

Perhaps the use of latent variables that is closest to multilevel analysis is 


5Note however, that through a reconceptualization, it may be possible to expand the number of 
contexts in order to introduce macro-level regressors. For example, Gamoran & Mare (1989) 
performed an endogenous switching regressions analysis of the effects of tracking on achievement 
in which students from the High School and Beyond Survey were classified in either the college or 
the noacollege track. Gamoran later (1992) reanalyzed High School and Beyond data using school 
as the context, and the track structure of the school as a macro variable explaining variation in 
between-school outcomes. 
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when scholars specify parallel covariance structure models for several distinct 
contexts (e.g. Kohn et al 1990). In the work by Kohn and associates, cross- 
national variation in model coefficients is given a multilevel interpretation, but 
the variations are not quantitatively modeled as a function of macro variables. 
Of course, when the number of macro contexts is small—as is the case when 
a small number of countries are compared—it is not possible to model variation 
in micro parameters as a function of macro variables, other than to test for 
coefficient differences. Significant variation in the microlevel coefficients can 
of course be attributed to various attributes of the macro contexts, but confi- 
dence in the validity of such assertions must be gained by identifying other 
factors (which could be macro variables or coefficients of other micro equa- 
tions) that vary with these attributes in ways predicted by theory. 

Related issues exist in the growing literature in comparative-historical so- 
ciology. Comparative-historical scholars have paid a great deal of attention to 
the distinction between "small-N" and "large-N" studies (Ragin 1987), but 
they have not yet studied in detail the benefits from combining “medium-size 
N" macro and "large-N" micro studies. One explanation for this omission is 
that comparative-historical researchers as a rule prefer "rich" complex com- 
parisons that require them to put a great deal of effort in a small number of 
cases. Time and money constraints limit their ability to include the number of 
macro cases required for statistical analysis. In addition, they have not yet 
pursued in sufficient depth the possibilities that even "small-N-macro"/"large- 
N-micro" studies can afford, of the form used by Kohn and associates. The 
number of such studies is beginning to grow, though they differ from the 
statistical models discussed above in that cross-context comparisons of within- 
context results are often done in a qualitative rather than a quantitative way 
(Janoski 1991; see also Hage et al 1989, Flora et al 1988a,b). 

Lastly, there have been some nascent efforts to generate models that explain 
macro processes or outcomes as a function of micro-level processes and 
outcomes. While the importance of using microlevel processes to explain 
macrolevel outcomes has long been recognized in the social sciences (Merton 
1936, Olson 1965, Schelling 1978), relatively little research has attempted to 
specify models that characterize or explain processes at the macro level by 
using microlevel data. Coleman & Hao (1989) propose one such method by 
looking at individual-level exchanges to characterize the exchange relation- 
ships in a system.? This model differs from more conventional multilevel 
regression models, however, in that it employs no macro characteristics—ei- 
ther as independent or as dependent variables—other than those revealed 
through data analysis of micro-level relationships among actors (e.g. the 


9For an example of a network analysis that looks instead at how macro network level properties 
affect micro behavior, see Markovsky et al 1988). 
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"price" of items being exchanged). Models that use microlevel relationships 
to explain characteristics of the system other than those that emerge from the 
model itself are still underdeveloped. 


SUBSTANTIVE RESEARCH WITH MULTILEVEL 
MODELS 


The methodological developments discussed above have been used to study a 
variety of substantive problems, many of which have been alluded to. In the 
space available, we can only briefly illustrate the utility of multilevel models 
for substantive research. 

In the area of demography, random-effects multilevel models have, to date, 
been used primarily to analyze the additive and interactive effects of commu- 
nity-level characteristics on the reproductive behavior of women in developing 
countries (Casterline 1985). Entwisle & Mason (1985) used multilevel models 
to study the connection between socioeconomic status and fertility. They found 
that, while socioeconomic factors affect the number of children ever born, the 
magnitude of these effects depends on a country's socioeconomic development 
and on the strength of its family planning programs. Countries at a low level 
of socioeconomic development and with no family planning programs have 
positive micro socioeconomic fertility differentials. Countries at a high level 
of socioeconomic development have strong negative individual-level socio- 
economic differentials when family planning programs are weak, but weaker 
individual-level socioeconomic differentials when family planning programs 
are stronger. In a study of contraceptive use, Entwisle et al (1986) find that 
while individual socioeconomic effects on contraceptive use are strong, coun- 
try-level effects are weak, in contrast to some earlier studies (see also Entwisle 
et al 1989). 

Some recent efforts have also been made to examine the main effects of 
various community-level factors on the reproductive and marital behavior of 
young adults in the United States. Hogan & Kitagawa (1985) and Hogan et al 
(1985) found that growing up in a poor, highly segregated community de- 
creases a black female's chances of using contraception at first intercourse and 
significantly increases her risk of having a premarital pregnancy. Crane (1991) 
recently found that the likelihood of having a birth among young black and 
white females is inversely related to the quality of local neighborhoods, as 
measured by the proportion of all workers in the community who were em- 
ployed as professionals and managers. Finally, Billy & Moore (1992) found 
that the proportion of women working full-time, the proportion of white collar 
workers, and the proportion of females who are currently separated or divorced 
in the community are all inversely related to the risk of experiencing a birth 
among married nonblack women, while the female unemployment rate, the 
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sex ratio of the never married population, and the child/woman ratio for women 
aged 15 to 24 are positively associated with the risk of experiencing a birth 
for unmarried nonblack women. 

The recent study by Lichter et al (1992) is the first to use multilevel models 
to examine the main effects of marriage market factors on first marital transi- 
tions of young black and white women in the United States. These authors 
found that while differences between blacks and whites in marriage market 
opportunities are quite large, such differences are unable to account for racial 
differences in the timing of first marriage. Three additional studies, which 
examine the impact of factors such as the proportion of female headed families, 
the proportion of adults with a college degree, or the divorce rate, generally 
support Lichter et al's conclusion (Hoffman et al 1991, Li 1992, Forristal 
unpublished). Studies of community factors on rates of marriage and fertility 
have so far largely focused on the main effects of context; studies that examine 
whether macro variables affect the values of microlevel coefficients other than 
the microlevel intercept remain to be done. 

The literature on school effects is voluminous. Coleman et al (1982) found 
that SES effects on test scores are flattened in Catholic schools compared with 
public schools, and Lee & Bryk (1989) extended their research by using 
random effects multilevel models. Their research also suggests that the effects 
of race are reduced in orderly schools, while the effects of class and academic 
background are reduced in smaller schools, in schools where the math curric- 
ulum is more homogeneous, and in schools where discipline procedures are 
fair and effective. Willms & Raudenbush (1989), in their analysis of the 
stability of the effects of school composition and policy on student achievement 
over time, found that variation in school effects for a given school tends to be 
small relative to the variation in school effects across schools. Other studies 
have looked at the strength of local labor market effects on decisions to remain 
in school (Raffe & Willms 1989), and sex differences and school effects in 
the growth in mathematics skills in lower and middle schools (Willms & 
Jacobsen 1990). Still other studies have examined the effects of placement 
within educational tracks, or attendance at schools that use educational tracks 
on achievement outcomes (e.g. Gamoran & Mare 1989, Gamoran 1992, Kerck- 
hoff 1993). A complete review of the multilevel educational research would 
require an entire review paper (see Raudenbush & Bryk 1986, Goldstein 1987, 
Bryk & Raudenbush 1988, Bock 1989, Willms & Jacobsen 1990, Rumberger 
& Willms 1991, or Crane 1991, for examples of related research). 

In the area of stratification, Grusky & Hauser (1984) examined the impact 
of societal attributes in explaining cross-national variation in social mobility. 
They found reasonably strong though complex effects of economic develop- 
ment on social mobility, and they found that the effects of political organization 
were equally strong. Treiman & Yip (1989) studied how macro factors might 
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explain cross-national variation in the coefficients of the basic status attainment 
equation. They found that industrialized societies tend to be more open than 
developing societies, and that societies which are more industrialized and 
which have more status equality show stronger effects of education, and weaker 
effects of family background on status attainment. Erikson & Goldthorpe 
(1992), in contrast, concluded that cross-national variation in social mobility 
is largely unsystematic once differences in occupational distributions are con- 
trolled. At a more micro contextual level, research into neighborhood effects 
on poverty especially in the United States found significant negative effects 
of coming from a welfare-dependent community or from a highly segregated 
community, but many effects of neighborhood context on economic outcomes 
appear weak (Datcher 1982, Mayer & Jencks 1989, Tienda 1991, Garner & 
Raudenbush 1991, Massey et al 1992, Corcoran et al 1993). 

Meanwhile, other scholars have examined how social change within a given 
society can change the process of stratification. DiPrete & Grusky (1990b) 
found that the gradual growth of bureaucratic personnel policies in the United 
States may have played a role in equalizing opportunity, but that direct effects 
of political intervention may have been more important sources of change. At 
the third level of their multilevel model, they showed that—consistent with 
prevalent theories about how labor markets operate—the magnitude of these 
changes was stronger for new entrants into the labor market than for experi- 
enced workers. In addition, many studies have analyzed trends in the coeffi- 
cients of stratification models without necessarily introducing macro variables 
to explain these trends (Hauser & Featherman 1978, Mare 1981, Hout 1988, 
Grusky & DiPrete 1990, Blossfeld & Shavit 1993). 

A huge literature examines the impact of organizational structure and labor 
market segment on career outcomes (Baron 1984, Rosenfeld 1992), usually 
with data from specific firms or with data about firms and labor markets 
collected from individuals. But, as noted above, some recent research uses 
multilevel data that was collected at both the individual and the firm level. For 
example, Villemez & Bridges (1988) have explored the influence of firm size 
and other organizational characteristics on earnings, and how these effects vary 
by gender and occupation. Published research based on the new National 
Organization Study is expected in the near future. 

In the area of criminology, several scholars have examined the impact of 
contextual factors on individual risks of victimization (Sampson et al 1987, 
Smith & Jarjoura 1989, Kennedy & Forde 1990, Miethe & McDowall 1993; 
PW Rountree, KC Land, TD Miethe, unpublished ms). For example, Miethe 
& McDowall (1993) showed that community measures such as the extent of 
family disruption, density of ownership of various consumer goods, racial 
heterogeneity, median income, proportion living alone, unemployment rates, 
and divorce rates influence risks of victimization. Aside from their finding 
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that individual safety precautions against burglary had a greater impact in 
more affluent neighborhoods than in poorer or commercial neighborhoods, 
community characteristics appeared to have only weak effects on the rela- 
tionship between microlevel variables and victimization, though Rountree et 
al found evidence that interactions between community and individual-level 
variables are larger than would appear from estimation of a fixed effects 
model. 

Multilevel research is found in many other areas of sociology and related 
fields aside from what is covered in our illustrative discussion above. The main 
purpose of the above discussion is to present evidence that, for some socio- 
logically important outcomes, context matters. It is probably a fair generaliza- 
tion to state that most such evidence involves the main effects of context on 
behavior, though important research has obviously also been done in studying 
the interaction structure, too. The evidence certainly does not justify the con- 
clusion that context always matters when theory suggests it might, nor that its 
effects, net of other factors, will always be substantively important (Mason 
1991). Furthermore, the above discussion does not address the underlying 
theoretical questions concerning the correct interpretation of measured con- 
textual effects (assuming these measured effects are, in fact, “real” and not the 
product of faulty estimation). 

With sociology still limited to “middle range” theorizing (Merton 1968), 
there is no general theory of multilevel interrelationships. Theoretical research 
in different substantive areas, however, is proceeding toward the goal of 
constructing plausible explanations for what context means and how it might 
affect individual behavior, while empirical research tests these ideas with 
multilevel data. The amount of multilevel research in sociology should grow 
rapidly as theories are refined, as already-researched problems are revisited 
with newer methods, and as new data allow the testing of new hypotheses on 
the relationship between macro and micro. 


THE NEED FOR CAUTION 


Despite the advantages of the new techniques for multilevel analysis, one must 
be careful in applying these techniques to substantive problems. The new 
techniques, while superior to the older ones, rely on assumptions that often 
will not hold in specific substantive contexts.!! Some of the potential problems 
are: 


Note that the interaction effects are not necessarily increased in size when a random-effects 
model is used. Cf Kreft & de Leeuw (1988). 

“partially overlapping lists of the potential difficulties raised by multilevel modeling can be 
found in Blalock (1984), Kreft et al (1990), and Mason (1991). 
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[1] Model complexity. The estimation of the coefficients of random-effects 
multilevel models is not trivial, and the sociological imagination can easily 
outrun the capacity of the data, the computer, and current optimization tech- 
niques to provide robust estimates. Kreft et al's observation in this regard is 
trenchant: "Investigators (if the past is any indication) will tend to choose 
models that are too complicated (five levels, with 10 variables on each level). 
This leads to impossibly difficult search problems over the space of models 
and to impossibly difficult likelihood maximization problems" (Kreft et al 
1990, p. 99). 

[2] The assumption of fixed regressors. The statistical theory underlying 
random-effects multilevel models assumes that the regressors are fixed and 
estimates the models conditional on their values. But in most practical appli- 
cations, the regressors are random, and unconditional estimation is desired 
(Kreft et al 1990). f 

[3] The problem of missing data. While increasingly sophisticated proce- 
dures are now available for handling missing data in statistical analysis (Little 
& Rubin 1987), existing software for random-effects multilevel regression 
models does not provide for internal (to the software package) treatment of 
the missing data problem. Given the growing evidence that naive treatment of 
the missing data problem produces biased estimates (Wang et al 1992), and 
given the amount of computation required to generate random-effects multi- 
level regression estimates, it is quite possible that an analyst who is faced both 
with missing data problems and with constraints on computing would do better 
by combining sophisticated methods for handling missing data with simple 
fixed effects multilevel regression methods than by combining naive methods 
for handling missing data with sophisticated random-effects multilevel regres- 
sion methods. The former strategy would yield consistent estimates of the 
regression parameters, though biased estimates of standard errors, but the latter 
strategy might produce inconsistent estimates of both the regression parameters 
and their standard errors. !? 

[4] Correlation between micro variables and errors in the macro equation. 
As Equation 6 shows, the error in the macro equation is a component of the 
error in the micro equation, too. If this error is correlated with microlevel 
regressors, the estimates of the microlevel coefficients will be biased. In this 
situation, the use of dummy variables to convert a random-effects multilevel 
regression model to a fixed effects multilevel regression model may be advised 
(Judge et al 1985, Greene 1990). 

[5] Normally distributed errors. Existing statistical packages generally as- 
sume normally distributed errors. Little is known about the robustness of this 


"See Muthén (1993) for a discussion of recent progress in the treatment of missing data in the 
context of a random effects multilevel] model 
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assumption. If the microlevel errors are normal or can be made normal through 
a suitable transformation, nonnormality in the macro equation can be addressed 
as in [4], through the use of dummy variables to convert from a random-effects 
to a fixed-effects multilevel regression problem. 

[6] Correlation between macro variables and errors in the macro equation, 
or between either macro or micro variables and errors in the micro equation. 
This problem bedevils all research that employs nonexperimental data to make 
causal inferences (e.g. Holland 1986, Berk 1988; see also Cook & Campbell 
1979, Blalock 1984, Bryk & Raudenbush 1992). If the errors in the macro 
equation are correlated with the measured macro-level variables, the macro- 
level coefficients are biased, and a causal interpretation may be inaccurate. 
Similarly, if the error in the microlevel equation is correlated either with 
microlevel or macrolevel variables, the estimates will be inconsistent. 

There are various reasons why such correlations may arise. Hauser (1970, 
1974) discussed the case where individual-level variables are poorly measured, 
such that macrolevel variables function as proxies for microlevel variables in 
a regression. For example, average SES in a school might be a proxy for 
individual SES if the latter variable is measured with error. Another problem 
occurs when characteristics of measurement error vary by context. For exam- 
ple, if measurement error for SES was greater in Catholic than in public school 
samples, or if the true variance of SES in Catholic schools was restricted 
relative to public schools while the error variance was the same in the two 
contexts, the measured effect of SES on achievement in Catholic schools might 
be smaller than in public schools when the true effects are the same size (RM 
Hauser, personal communication). In all of these examples, what appears to 
be a multilevel effect may be an artifact of the failure to correct for measure- 
ment error in the microlevel model. 

Another situation where such correlations could arise is when the macro-level 
variables are in some sense endogenous to the microlevel process (Blalock 
1984). To take an example from the published controversy over Coleman et al's 
report on Public and Non Public Schools (1982), if parents of children who—for 
unmeasured reasons—are high achievers seek out Catholic schools, the correla- 
tion between the individual-level errors and the school type variable would lead 
to bias in the effect of school type in the macro equations (e.g. Goldberger & Cain 
1982). As with the problem of measurement error, selectivity bias is not 
eliminated by the use of more sophisticated multilevel procedures. Combining a 
sophisticated treatment of measurement error, selectivity bias, and related 
problems with the use of simpler fixed effects multilevel regression models may 
often produce more accurate results than a strategy which combines naive 
treatment of these issues with random-effects multilevel modeling. 

[7] The assumption of exchangability: An assumption underlying the use of 
empirical Bayes procedures to increase the reliability of within-context esti- 
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mates is that the different contexts be “exchangeable” (Lindley & Smith 1972). 
In the absence of explicit covariates in the macro equation, this amounts to 
assuming that the microlevel coefficients differ across contexts only by chance 
variation. In the presence of explicit covariates in the macro equation, this 
amounts to assuming that the residual variation in the macro equations is 
unsystematic. If the contexts are exchangeable, then the researcher can improve 
the precision of within-context coefficient estimates by "borrowing" informa- 
tion from other contexts. However, if the contexts are not exchangeable—in 
other words, if the distinctiveness of a given context is not fully captured by 
the macrolevel variables—then the "shrinkage" estimator for the context-spe- 
cific coefficients may be biased (McCullagh 1989, Bryk & Raudenbush 1992). 
Since we do not know how bad the bias is, we do not know (e.g. in a 
mean-square error sense) whether the tradeoff is worthwhile for any particular 
set of within-context estimates. Furthermore, the bias-variance tradeoff may 
differ for each context in the model. 

The failure of exchangeability may be viewed as a problem of data collection 
at the macro level, or it may be viewed as a problem of sample size at the 
macro level. If the number of factors that make contexts distinctive is large 
relative to the number of contexts, then the ability of the analyst to model these 
variations is obviously limited. Arguably, this situation is more of a problem 
for the analysis of complex, unique entities such as countries than it is for 
more standardized organizations such as schools. When analyzing a set of 
unique contexts, the justification for performing random-effects regression 
models may be weak, and a fixed-effects dummy-variable model may be more 
appropriate (Greene 1990). Unfortunately, there is no easy answer to the 
question of when a fixed or a random effects model is more appropriate. 


DIRECTIONS FOR FUTURE RESEARCH 


The rate of development in the methodology of multilevel models and in their 
substantive application to a variety of sociological problems has been enor- 
mous in the past decade. This rate is likely to continue in the near future (e.g. 
Hox & Kreft 1994). A variety of methodological problems await solutions. At 
the mundane level, there is the prospect of faster computational algorithms 
and more capable hardware that will allow users to analyze more complex 
problems. We can also hope for a closer linkage between the literature on 
multilevel estimation and multilevel study design. Often a tradeoff exists 
between the number of microlevel observations that can be collected within 
each context and the number of contexts that can be sampled. Users await 
useful guidance about how they should allocate their data collection resources. 

We should also expect the development of more general multilevel models 
and software with which to estimate them. One such development would be 
the extension of the random-effects multilevel regression model to the case of 
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event-history analysis; while another would be a similar extension for loglinear 
and related models for contingency tables. Because many common event 
history models and models for the analysis of contingency tables are general- 
ized linear models (McCullagh & Nelder 1989), developments of multilevel 
statistical theory for this more general case (along with appropriate computer 
software) would have wide applicability in sociology. Generalizations of the 
generalized linear model to include random terms in the linear predictor are 
called generalized mixed linear models (GLMM), and recent work by e.g. 
Goldstein (1991) or Breslow & Clayton (1993) on approximate methods of 
inference in GLMMs may lead to improved practical approaches to the esti- 
mation of multilevel event history models, multilevel loglinear and logmulti- 
plicative models, and other related multilevel models in the near future. 

A second important development would be the development of multilevel 
models for systems of microlevel and macrolevel endogenous variables. The 
current techniques can handle only a single microlevel dependent variable. 
Research is currently underway to create a "path analysis" for multilevel 
research (Kreft et al 1990, p. 23). 

A third hoped for development would be the extension of latent variable 
methods so that they could be applied to multilevel problems. One welcome 
advance would involve the case where the microlevel variables are latent, 
while the contexts and macrolevel variables are measured exactly. As Muthén 
& Satorra noted (1989), estimation of such a model requires both further 
statistical development and new statistical software (see also Muthén 1994). 

A more ambitious latent variable model would allow the macro contexts to 
be latent as well. The use of latent contexts would obviously require that the 
macro variables for these contexts also be latent, since the boundaries of 
observable "contexts" would not exactly correspond to the boundaries of latent 
contexts. Such a model might be viewed as an extension of latent class analysis. 
Instead of searching for latent classes that explain variation in some outcome 
variable by reducing the partial association (conditional on the latent class) 
between measured covariates and this outcome variable to a minimum, an 
extended latent class model might be employed to explain remaining variation 
in the effects of covariates as a function of properties of the latent classes, 
modeled as a function of macro indicator variables. 

For example, several scholars in the 1970s and 1980s employed notions of a 
dual economy or a segmented labor market in attempts to explain variation in 
stratification outcomes (Hodson & Kaufman 1982). However, these structures 
were not precisely measured; often it was not obvious which segment any given 
worker belonged to. This fuzziness did not prevent scholars from proposing 
variables that were indicators of market segment (e.g. measures of technology, 
firmsize, organizational and industry attributes, or occupational characteristics). 
In a latent multilevel model, market segments might be latent contexts that are 
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explained by and indicated by firm, occupational and industrial characteristics. 
The latent properties of these latent contexts would explain coefficient variation 
in microlevel equations for stratification outcomes.'? Given the formidable 
computational difficulties that already exist for current versions of multilevel 
models, it is hard to say what the prospects are for the development of statistical 
theory and useful algorithms for such multilevel latent contextual models. 
Clearly, however, there is room for further advancement. 

The existence of latent multilevel models might also create a unified frame- 
work for the interpretation of the coefficients of ordinary microlevel models. 
Suppose an analyst has constructed a model for attitudes toward particular 
government policies that includes measures of education and income, and that 
is estimated with OLS regression. Suppose we then consider the interpretive 
and statistical difficulties raised by the coefficient for income. What does it 
mean? From a substantive perspective, interpretations of the income effect are 
often contextual. Income has an effect not only because one's interests are 
related to the amount of money one has in the bank, but also because income 
is associated with patterns of interpersonal association, of upbringing, and of 
exposure to mass media. Ideally, one would like to identify the characteristics 
of this socio-economic-cultural context, and use its attributes to explain the 
effects of individual-level variables on outcomes. From this perspective, vir- 
tually all sociological models become contextual, with micro and macro 
variables defining and explaining contexts that provide the explanations for 
varying parameters at the individual level. Needless to say, identification issues 
for such latent class models would be quite formidable. The likelihood func- 
tions would be complex, and the robustness of results would be uncertain. 

In this context, it is helpful to recall the still-germane observations made by 
Blalock in his 1984 review of contextual analysis. The ultimate goal of multi- 
level analysis is to determine how social context affects and is affected by 
social behavior. Improved answers to this question require parallel advances 
in theory and data collection as well as in statistical methodology. While there 
is plenty of room for improvement in all of these domains, the rate of progress 
is encouraging. 
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See Dickens & Lang (1988) for an application of an endogenous switching regressions model 
with latent regimes to the problem of employing dual labor markets as a context without an a priori 
definition of labor market boundaries. 
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Abstract 


~This paper discusses five themes related to the formation and functioning of 
stepfamilies. The first section examines how demographic trends, particularly 
changing marriage and remarriage rates and rising levels of cohabitation, are 
affecting post-marital family arrangements. The second section looks at the 
creation of new forms of kinship associated with remarriage and cohabitation. 
The third and fourth parts of the paper explore issues related to the social 
organization of stepfamilies and the consequences for children. The final 
section of the paper takes up the question of whether and why remarried 
persons are at higher risk of divorce. We discuss some implications of research 
for public policy and current theoretical debates about the status of the Amer- 
ican family. 


INTRODUCTION 


In the late 1970s, when rates of divorce and remarriage were at historical high 
points in the United States, each of us separately reviewed the surprisingly 
small number of studies on stepfamily life. In our reviews, we presented an 
agenda of ideas that guided our individual and collaborative investigations 
over the next one and one half decades. Cherlin’s review noted the striking 
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absence of well-defined rules for family life among households formed by 
remarriage (Cherlin 1978). He hypothesized that higher rates of separation and 
divorce might be linked to the structural anomalies of this “incompletely 
institutionalized" family form. Furstenberg observed that the process of "re- 
cycling the family" replaced the nuclear family with a distinctively different 
family form (Furstenberg 1979, Furstenberg & Spanier 1984). He was espe- 
cially concerned with the ways that remarried couples thought differently about 
marriage and family life after divorce and the implications of remarriage for 
the workings of the American kinship system. 

In this article, our intention is not to summarize all of the recent literature 
that has been produced since our early reviews. Several recent books and 
articles provide excellent introductions to the current research (Pasley & Ihin- 
ger-Tallman 1987, Hetherington & Arasteh 1988, Beer 1988, Coleman & 
Ganong 1990). Our objective is more modest. We want to revisit some of the 
topics that we raised in our earlier papers and books; and we wish to examine 
subsequent research on these topics. For convenience, we have grouped our 
comments into five sections: 


1. The changing demography of divorce and remarriage. 

2. The construction of new forms of kinship among members of stepfamilies. 

3. Studies of stepfamily "process": the daily interactions through which indi- 
viduals build a stepfamily life. 

4. The effects on children of living in a stepfamily. 

5. The risk of divorce among remarried persons. 


We believe that remarriage and stepfamily life constitute a strategic site for 
research on the family. The daily dilemmas of creating and conducting family 
life after divorce lead one to ask: what is a family, what are the obligations of 
parenthood, and how are bonds of kinship formed and maintained? There are, 
of course, no definitive answers to these questions; but our task as social 
Scientists is to discover how answers are being formulated by those who 
experience divorce and remarriage and by those who devise policies and 
practices for dealing with the consequences of family change. 


THE CHANGING DEMOGRAPHY OF DIVORCE AND 
REMARRIAGE 


Among developed nations, the United States has unusually high rates of di- 
vorce and remarriage. Close to a third of all Americans will marry, divorce, 
and remarry. But informal, cohabiting unions also are increasingly common. 
The substitution of cohabitation for marriage makes it more difficult to follow 
remarriage trends and to describe stepfamily life. This trend toward the deinsti- 
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tutionalization of marriage may be partly a response to the high risks of divorce 
and the complexities of stepfamily life. 

Until the 1960s, the remarriage rate and the divorce rate in the United States 
rose and fell in parallel —when the divorce rate increased, so did the rate of 
remarriage. It seemed that divorced people were not rejecting the ideal of being 
married, they were just rejecting their own first marriages and trying again. 
Starting in the 1960s, however, the annual rate of remarriage (the number of 
remarriages in a given year divided by the number of previously married 
persons age 15 and older in the population) fell even though the divorce rate 
began to rise. The fall has continued ever since. Át current rates, only about 
two thirds of separated and divorced women would ever remarry, compared 
to three fourths in the 1960s; the corresponding figure for men is about three 
fourths compared to more than four fifths in the 1960s.! 

There are distinct variations in the propensity to remarry among subgroups 
of Americans. These variations reflect both choice and opportunity. For ex- 
ample, one study found that women who marry in their teenage years or early 
twenties are more likely to remarry than are women who married later; the 
authors speculate that women who married young may have less experience 
in, and less of a preference for, living independently Among women who 
choose to remarry, some may have more difficulty finding suitable spouses 
than others. Women who have three or more children, for instance, have a 
lower likelihood of remarrying, probably because they have a harder time 
finding a desirable partner willing to share the responsibilities of supporting 
a large family. And older women face a shrinking marriage market because 
of most men's preference for younger partners. Consequently, women who 
divorce at younger ages are more likely to remarry than are those who divorce 
at older ages. 

In addition, remarriage is far more likely among non-Hispanic whites than 
among Hispanics or African-Americans. According to one estimate from 1980 
US Census Bureau data, about half of all non-Hispanic white women will 
remarry within five years of their separation, compared to one third of Mexi- 
can-American women, and one fifth of African-American women (Sweet & 
Bumpass 1987). These differences occur, in part, because remarriage rates are 
lower for the poor than for the non-poor. With few assets and little property 


!Basedon 1985 data, Larry L. Bumpass, James Sweet, and Teresa Castro Martin (1990) estimated 
that 72% of recently separated women would remarry. But remarriage rates have declined further 
since then. A 1992 Census Bureau report suggests that the true figure may be closer to two thirds; 
see US Bureau of the Census (1992c). The most recent estimate for men—78% remarrying within 
10 years—is from tbe 1980 Census data and is probably too high now. See James A. Sweet and 
Larry L. Bumpass (1987). 

2AII of the findings in this paragraph are from Larry Bumpass, James Sweet, and Teresa Castro 
Martin (1990). 
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to pass on to children, people with low incomes have less need for the legal 
protection marriage brings. Yet lower incomes do not account for the entire 
difference. The low remarriage rates for African-Americans also are consistent 
with the lesser place of marriage in the African-American family. 


Cohabitation 


The overall decline in remarriage, however, is deceptive. As remarriage rates 
have declined, cohabitation among the formerly married has increased. Figure 
1 shows the change. It is based on data from the National Survey of Families 
and Households, conducted in 1987 and 1988. Individuals were asked to recall 
their personal history of union formation and dissolution. The bars on the left 
hand side represent the situation in approximately 1970, according to the 
retrospective histories given by the respondents. Forty-nine percent of persons 
had remarried within five years after they separated from their spouses. In 
addition, some had cohabited with a partner without marrying. When those 
who cohabited without remarrying are added to those who remarried, the sum 
is the number who were ever in a union—marital or cohabiting—within five 
years of separating. As can be seen, 58% had ever been in a union in 1970. 
By 1984, only 4296 had remarried within five years of separating, reflecting 
the drop in the remarriage rate. But the percentage who had ever been in a 
union had increased from its 1970 level to 62% in 1984. In the interim, 
cohabitation had become so widespread among the previously married that its 
increase had more than compensated for the decrease in remarriage (Bumpass 
et al 1991). 

Divorced persons, in other words, have not reduced their propensity to live 
with someone; rather, they have substituted cohabitation for remarriage. We 
interpret this pattern as an indication that remarriage, like first marriage, is 
becoming less obligatory and socially regulated. It follows that informal unions 
are generally less stable and secure arrangements. About one out of seven 
people who eventually remarry live with a different partner between marriages 
(Bumpass & Sweet 1989). Indeed, the provisional nature of informal unions 
may be part of their appeal for individuals who may be hesitant, at least 
temporarily, to recommit to formal marriage. So the velocity of transitions into 
and out of unions has surely increased since cohabitation became more widely 
acceptable in the United States. Accordingly, official marriage statistics are 
rapidly becoming unreliable indicators of patterns of family formation and 
reconstitution. 

To the extent that we are witnessing a trend toward the deinstitutionalization 
of remarriage, the rules and roles of family life are probably becoming more 
discretionary and less uniform. Much of what we discuss in later sections of 
this paper follows from the premise that for a large segment of the United 
States population, family life is becoming less predictable, more varied, and 
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Figure 1 Percentage of maritally separated and divorced persons who remarried within 5 years, 
and who were ever in a union (marital or cohabiting) within 5 years, in or about 1970 and in or about 
1984. Source: Retrospective union histories provided by respondents in the National Survey of 
Families and Households, 1987-1988, as reported in “The Role of Cohabitation in Declining Rates 
of Marriage,” Larry L. Bumpass, James A. Sweet, and Andrew Cherlin, J. Marriage Fam. 
53(1991):913-27. 


more complex. This trend has important implications for both the social orga- 
nization of family life and the way that family members interpret and enact 
kinship obligations, the topic of our next section. Later on we mention how 
the growing precariousness of postmarital arrangements may be affecting 
children’s development and well-being. 


NEW FORMS OF KINSHIP AND FAMILY 
ORGANIZATION 


One of the taken-for-granted aspects of family life in the West has been that 
the parents and children in the conjugal family will live in the same household 
until the children grow up. Until the last few decades, that assumption was 
justified. The increases, first, in divorce and remarriage and, more recently, in 
cohabitation and out-of-wedlock childbearing, have made this assumption 
problematic. Only 59% of American children lived with both biological parents 
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Household 1 Household 2 Household 3 


C) Female CY male <= Mariage p Divorce 


Figure2 Kinship ties due to divorce and remarriage in a network, reported by Anne C. Bernstein, 
“Unraveling the Tangles: Children's Understanding of Stepfamily Kinship,” pp. 83-111. In Relative 
Strangers: Studies of Stepfamily Processes, ed. William R. Beer.totowa, NJ: Rowan & Littlefield. 


in 1992.3 Divorce splits the conjugal family into two households—one that 
typically contains a custodial parent (usually the mother) and the children and 
a second that contains the noncustodial parent (usually the father). 
Remarriage can bring a multitude of ties across households, creating what 
one of us has called "the new extended family” (Furstenberg 1987). In 1992 
11.2% of all children were living with one biological parent and one stepparent, 
Consider one set of family ties studied by Anne Bernstein (1988) and dia- 
grammed in Figure 2. This set of family ties is centered on the marriage of 
Carin and Josh, who reside in household 2. They have a mutual child, Alice. 
Josh previously was married to Peggy, with whom he had two children, Janet 
and Tim, who live with Peggy in household 3. Carin previously was married 
to Don, with whom she had two children, Scott and Bruce, who still live with 
her. Her former husband Don then remarried Anna and had two more children, 


3We calculated this figure (and the 11.2% figure in the next paragraph) based on information 
from the June 1990 Census on children living with neither parent, as reported in US Bureau of the 
Census (1992a); information on the biological versus nonbiological status of parents of children 
who were living with two parents, in the June 1990 Current Population Survey (US Bureau of the 
Census 1992c); and the distribution of living arrangements of all children, in the March 1992 Current 
Population Survey (US Bureau of the Census 1992b). 
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Ethan and Ellen, who live with Anna and him in household 1. Here is how 
Bruce described his family: 


Tim and Janet arc my stepbrother and sister. Josh is my stepdad. Carin and Don 
are my real parents, who are divorced. And Don married Anna and together they 
had Ethan and Ellen, my half-sister and -brother. And Carin married Josh and 
had little Alice, my half-sister (Bernstein 1988, p. 101). 


How are we to make sense of this admixture? How many families are 
involved? What are their boundaries? The relationships spill over the sides of 
households, with children providing the links from one to the next. Let us 
suggest that there are two ways to define families in this context. The first is 
to focus on a household, even though ties extend beyond it. The advantage of 
this strategy is that in developed Western societies, we are accustomed to 
thinking of immediate families as being contained within households. We 
define a stepfamily household as a household that contains a parent with 
children from a previous union and that parent's current partner. The children 
from the previous union are the stepchildren, and the current partner is the 
stepparent. The household can be even more complex: both partners may have 
children from previous unions and they also may have a new, mutual child 
from the current union. But the defining criterion is that they all reside in the 
same household. (Although there are three households in Figure 2, only house- 
hold 2 is a stepfamily household. Household 1 has no children from previous 
unions living there; and household 3 has children from a previous union but 
no current spouse or partner.) In fact, there is little research comparing the 
similarities and differences between these expanded households and their 
simpler nuclear counterparts regarding the exercise of authority, the allocation 
of time and resources, the bonding of members, routines and rituals, and the 
like (Hetherington & Jod] 1993). 

Note that we have not restricted our purview to families where biological 
and nonbiological parents are currently married, or even previously married. 
We want to include the large and growing number of stepfamilies that are 
formed when previously-married people cohabit, or when an unmarried woman 
bears a child and later moves in with a man who is not the biological father. 
This latter kind of stepfamily, in which there was no prior marriage but rather 
an out-of-wedlock birth, has received little attention, although as we have 
already observed, it has become common (Beer 1988). 

The second way to define families is to ignore household boundaries and 
to focus instead on the chains that extend from one household to another. The 
links of the chains are children from previous unions. They connect a divorced 
woman and her new partner with her ex-husband and his new partner. Anthro- 
pologist Paul Bohannan (1971) called these pathways “divorce chains." But it 
is probably more accurate to refer to them as remarriage chains, as illustrated 
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by the complicated network of ties among the three households depicted in 
Figure 2. 

In fact, if you asked any of the members of the households in Figure 2 who 
was in their family, you likely would get a different answer from almost every 
one of them. To be sure, full siblings such as Bruce and Scott probably would 
name the same chain. Otherwise, each person has his or her own distinctive 
chain. Consequently, the only way to specify the members of a remarriage 
chain is to define it in reference to a particular person—not a household. 

Remarriage chains can serve as support and exchange networks, when 
ex-spouses and new spouses give and request favors (Keshet 1988). This 
system depends on the quality of relations and cooperation of all involved and 
can easily be disrupted. In extended kin networks centered on conjugal families 
or on "blood" relatives (see below), people might extend favors based on 
goodwill and a sense of shared purpose. In remarriage chains that sense of 
goodwill is limited, and decisions are more likely to be made using the calculus 
of self-interest. It is best thought of as a system with weak rather than strong 
ties, a distinction between systems of support made by Granovetter (1973). 
Remarriage chains are more likely to provide information and referrals than 
direct material and emotional assistance (Furstenberg 1981, Johnson 1988, 
White 1993). 

So we might expect to discover far more variability and instability in the 
kinship networks of stepfamilies than is typical in nuclear households. And it 
is largely unknown how these networks operate when the quasi-stepparents in 
cohabiting units are involved. Do family alliances become even more im- 
provised and unpredictable when the roles of members are less clearly speci- 
fied? In family systems where membership is fluid, a sense of kinship can be 
invoked easily; but a sense of kinship also can be withdrawn easily. We return 
to this point when we discuss the consequences of divorce and remarriage later 
in the paper. 


Doing the Work of Kinship 


When Americans think about kinship, they tend to think about people related 
through either "blood" or marriage (Schneider 1980). Mere existence of a 
blood tie, however, does not necessarily make two people think of each other 
as kin. Kinship is achieved by establishing a “relationship,” seeing each other 
regularly, corresponding, giving or receiving help—that is, by making repeated 
connections. The absence of a relationship may mean that even a blood relative 
may not be counted as family. To be sure, almost everyone considers their 
parents and their children to be kin even if they have not seen them in a long 
time. But Americans would understand what someone meant if she said, “My 
father left home when I was three and I never saw him again; he’s not part of 
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my family. My stepfather is really the person I consider to be my father.” And 
similarly a person might not regard a cousin as a relative if he never met her. 

To be a relative, you must do the work of creating and maintaining kinship. 
Among parents and children, this happens almost automatically—so much so 
that we rarely think about it. But among stepparents and stepchildren it does 
not happen automatically. For one thing, a stepparent in a remarriage that has 
followed a divorce does not replace the stepchild’s nonresident parent, as was 
the case when most remarriages followed a death. [See also Johnson’s (1988) 
study of the indicators of kinship ties following divorce.] Rather, the stepparent 
adds to the stepchild’s stock of potential kin. If both biological parents are still 
involved in the stepchild’s life, it is not clear what role the stepparent is 
supposed to play. There are few guidelines, few norms. This situation is what 
led one of us to conclude that the role of the stepparent is incompletely 
institutionalized (Cherlin 1978). 

In fact, there is great variability in how stepparents and stepchildren view 
each other. In a 1981 national survey, children ages 11 to 16 and their parents 
both were asked who they specifically included in their family. Although only 
1% of parents did not mention their biological children, 15% of those with 
stepchildren in the household did not mention them. Among children, 31% of 
those in stepfamily households omitted the stepparent who was living with 
them; and 41% failed to mention a stepsibling (Furstenberg 1987). 

What, then, determines how stepchildren and stepparents view each other? 
A key factor is how old the child was when the stepparent joined the household: 
the younger the child, the more likely he or she is to consider the stepparent 
to be a “real” parent (Marsiglio 1992). The evidence isn’t precise enough to 
establish an age cut-off for emotional bonding. Still, we suspect that if the 
stepparent arrives during the pre-school years (before the child is five), it is 
possible to establish a parent-like relationship; but if the stepparent arrives 
much later, strong bonds form much more rarely. Research shows that children 
establish strong bonds of attachment to their parents, whom they rely on for 
security, within their first year or two. Children’s attachments become some- 
what reduced after the pre-school years (Parkes & Hinde 1982). A second 
factor is how frequently the stepchild sees his or her nonresident parent; the 
less frequent their interaction, the easier it is for a stepparent to take on a 
parent-like role (Marsiglio 1992). A third factor is the quality of the relationship 
between the stepparent and the biological parent in the home. The more 
satisfactory that relationship, the more authority the stepparent has to take on 
a parental role. Finally, there is probably individual variation depending on 
the child’s temperament. Some children may be more welcoming to new 
parents than others. So there may be differences in the quality of relations 
among children in the same family. 

The kinship terms used by children to designate their parents are one measure 
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of the bonding process. There is no agreed upon direct term of reference for a 
stepparent (Cherlin 1978). For example, few if any children call their stepfather 
"Step-Dad." and, it would be equally rare for children to refer to him formally as 
“Mr. Jones." Instead, some will call him “Dad” but many will use the stepparent's 
first name. Inthe absence of empirical research that relates the use of terminology 
to bonding, we are inclined to think that the use of the first name suggests a 
relationship that is neither parent nor stranger, but somewhere in between. If 
children do not address stepparents using the parental term for the biological 
parent, then children may not grant stepparents the reciprocal rights and obliga- 
tions ordinarily accorded to so-called "real" parents. 

Residential and nonresidential biological parents often have a strong interest 
in the use of names (Furstenberg & Spanier 1984). A custodial mother may 
urge her children to call her new husband "dad" in order to replace the role 
of the noncustodial father. And the latter, in turn, may contest this effort to 
supplant him with a surrogate. Some members of a group of inner-city African 
Americans interviewed by one of us (Furstenberg) made a distinction between 
“Fathers” and *Daddies"—the men who propagate the children and the men 
who care for them (Furstenberg et al 1992). "Fathers" are the biological parents 
of the child, but “Daddies” are the men who assume responsibility for their 
children—who play the role of father to them. The group of African Americans 
hardly ever talked about having a stepfather. It may be that among African 
Americans, fatherhood is earned rather than accorded by birth or marriage. As 
we argue later on, this sort of cultural reasoning may actually be spreading 
more broadly in the United States as union instability increases. There is even 
greater variability in how more distant stepkin relate to the stepchildren. When 
the two of us carried out a national study of grandparents, we asked them about 
relationships with stepgrandchildren. Once again, the younger that children 
were when their parents remarried, the more that grandparents reported feeling 
that the children were “like biological grandchildren." One stepgrandmother, 
who had not acquired her stepgrandchildren until they were teenagers, was 
asked what they called her: 


Harriett. I insisted on that. They started by calling me Mrs. Scott .... But from 
the beginning, you realize, these children were in their teens, and it was hard to 
accept somebody from an entirely different family, and they didn't know me from 
Adam .... Now if they were smaller—you know, younger—it would have made 
a difference (Cherlin & Furstenberg 1992, p. 158). 


It also made a big difference where the stepgrandchildren were living: for 
example, it made a difference whether they resided with the grandparents' 
adult children (as when a son married a woman who had custody of children 
from a previous marriage) or were living most of the time in another household 
(as when a daughter married a man whose children lived with his former wife 
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except for every other weekend and a month in the summer). Within these 
constraints, the closeness of the relationship depended on how much effort the 
stepgrandparents and their adult children put into creating the relationship. 
Being a steprelative depends on doing the work of kinship. 

In other words, remarriage is making parenthood and kinship an achieved 
status rather than an ascribed status, to use the classic distinction in anthro- 
pology and sociology (Davis 1948, Eisenstadt 1966). Traditionally being a 
father or a mother had been a status ascribed to individuals at the birth of their 
child, which generally occurred only after the couple married. To be sure, 
people marry and have children through their own efforts; nevertheless, one 
did not have to do anything else to be a parent, nor could one easily resign 
from the job, especially in a family system that strongly discouraged divorce. 
Tn this sort of stable family system, being a grandparent was similarly ascribed. 
Those rules still apply to the majority of children who are born to two married 
parents. 

Remarriage after divorce, though, adds a number of other, potential kinship 
positions. Whether these positions are filled depends on the actions of the 
individuals involved. The most obvious positions are stepfather and step- 
mother. We have discussed the wide variation in the roles that stepparents 
play. Some are parent-like figures who are intensely involved with their step- 
children. Many others are more like friends or uncles and aunts. Others, 
particularly stepparents who don’t live with their stepchildren every day, may 
be like distant cousins—available for a kinship relation but, in fact, rarely 
assuming an important position in the child’s network. In all cases, how much 
like a family member a stepparent becomes depends directly on his or her 
efforts to develop a close relationship with stepchildren. Kinship relations in 
stepfamilies belong to the broader category of in-law relationships—ties cre- 
ated by marriage or marriage-like arrangements. Such ties are characteristically 
discretionary and even more so in the absence of marriage. In fact, in France, 
the current term for stepparent, beau-parent, also means parent-in-law. 

Intergenerational ties to stepgrandparents are even more voluntary; they 
range from no contact to a kin-like role, depending in large part on the 
investment the stepgrandparents make. Our research on grandparents, in fact, 
reveals considerable variation in the role played by stepgrandparents in chil- 
dren’s lives. To a large extent, remarriage restores a measure of balance 
between the maternal and paternal lines in the networks of kinship. However, 
there is reason to suspect that unless remarriage occurs early in the child’s life, 
kinship exchanges over the long-term are strongly tilted toward the (custodial) 
mother’s side of the family. The diminishing importance of legal marriage 
may accentuate this trend. It seems likely to us that cohabitation will undermine 
the sense of obligation to extended family that is created by matrimony. For 
example, the quasi-in-laws of cohabiting couples are even less likely than the 
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in-laws created by a remarriage to form and maintain enduring bonds. But, as 
we noted before, research on the consequences of cohabitation is scarce. 


STEPFAMILY "PROCESS": BUILDING A STEPFAMILY 


After divorce, single-parents and their children establish, often with some 
difficulty, agreed-upon rules and new daily schedules. They establish ways of 
relating to each other that may differ from the pre-disruption days. A daughter 
may become a special confidante to her mother. A son may take out the 
garbage, wash the car, and perform other tasks his father used to do. Put another 
way, single-parents and children create a new family system. Then, into that 
system, with its shared history, intensive relationships, and agreed-upon roles, 
walks a stepparent. It can be difficult for the members of the stepfamily 
household to adjust to his or her presence. 

Recent research suggests that the adjustment can take years to complete. 
One family therapist argues that the average stepfamily takes about seven years 
to finish the process (Papernow 1988). That is a long time, considering that 
more than one fourth of all remarriages disrupt within five years (Martin & 
Bumpass 1989). At the start, the stepparent is an outsider, almost an intruder 
in the system. At first, the stepparent may view himself naively as a healer 
who will nurse the wounded family back to health (Papernow 1988). But his 
initial efforts may hurt rather than help him attain his goal: a stepdaughter may 
resent the intimacy and support a new stepfather provides to her mother; a son 
may not wish to relinquish washing the car to a well-meaning stepfather who 
thinks he is just doing what fathers are supposed to do. As the two of us wrote, 
“stepparents quickly discover that they have been issued only a limited license 
to parent." The wiser ones among them accept the limits of their job description 
and wait for their time to arrive (Furstenberg & Cherlin 1991, p. 85). 

According to recent articles, family therapists seem to agree that for a 
stepfamily household to be successful, the remarried couple must build a 
boundary around themselves and work together to solve problems. This process 
is made more complicated by the negative images of stepparents in the larger 
culture (Ganong et al 1990) and their weak status in our legal system (Fine & 
Fine 1992). Their own marriage, rather than the relationship between the 
biological parent and the child, must become the dominant sub-system within 
the stepfamily household (Keshet 1988, Papernow 1988, Spanier & Fursten- 
berg 1987). To do so, they must reserve time for each other, even if that means 
sometimes deferring the demands of others. The task of the remarried couple 
is to create a shared conception of how their family is to manage its daily 
business. They cannot rely on generally accepted norms, as adults in first 
marriages can, because few norms exist. They must draw the blueprints them- 
selves and then construct the family. And, depending on their age, children 
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must be brought into taking an active part in the construction of a new family 
(Hetherington & Jodl 1993). 

During this process of family building, research suggests, it can be harder 
to be a stepmother than a stepfather (White 1993). In the typical remarriage 
chain, the children live with their biological mother and a stepfather; they 
visit their biological father and his new wife, who is their stepmother. 
Consequently, the typical stepmother does not live with her stepchildren; 
rather, she must establish a relationship during the visits. She usually is 
seeing children whose primary tie is to their biological mother, with whom 
she must compete. In contrast, stepfathers compete with noncustodial fathers, 
many of whom see little of their children (Seltzer 1991). Moreover, in the 
minority of cases in which the children live with the stepmother and the 
biological father, other difficulties can arise. In these atypical cases, the 
children may have been subject to a custody battle, or they may have been 
sent to live with the father because the mother could not control their 
behavior (Ihinger-Tallman & Pasley 1987). And mothers who are noncusto- 
dial parents visit their children and telephone them more often than do 
noncustodial fathers, creating competition with the stepmother (Furstenberg 
& Nord 1985, White 1993). 

Stepfathers, in other words, often can fill a vacuum left by the departed 
biological father. Stepmothers, in contrast, must inhabit the space already 
occupied by the biological mother. Moreover, stepmothers may judge them- 
selves according to the culturally dominant view that mothers should play the 
major role in rearing children; if so, they may fall short of these high standards. 
Stepfathers, in contrast, may hold themselves to the lower standard, namely, 
that fathers are supposed to provide support to the mother but let her do most 
of the hands-on childrearing. If so, they may feel satisfied with their role 
performance, even if they are doing less than many dissatisfied stepmothers 
(Keshet 1988, White 1993). 

One critical area of family building that has not received much attention is 
the merging of economic systems. Of course, nuclear families must integrate 
economic resources and devise a system of allocating funds. Still, it is obvious 
that divorce complicates this process. Often, the chains of connections that we 
referred to in Figure 2 signify not only emotional connections but also eco- 
nomic claims. The unemployment of a former partner has consequences for 
the household economy of his or her current spouse. Also, there are the 
inevitable problems of dividing the expenses of childrearing between current 
and former partners. These economic dilemmas extend to kin outside the 
family. Does divorce obliterate economic exchange among extended kin? Does 
remarriage automatically require it? These questions become even more murky 
when couples cohabit instead of remarrying. How do the quasi-stepparents in 
cohabiting unions recognize their obligations to their partner’s children or to 
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the relatives of their partner, their quasi-in-laws? Hardly any research has been 
done on these topics. 

We do not want to leave the impression that stepfamily life in the United 
States is an interminable struggle. Most stepparents report that they are happy 
with their roles and their new families. In the 1981 survey, a large majority 
of parents and children in stepfamily households rated their households as 
“relaxed,” “orderly,” and "close"; and less than one third found them “tense” 
or "disorganized." Nevertheless, the ratings of persons in stepfamily house- 
holds were slightly but consistently less positive than the ratings of persons in 
first marriage households (Furstenberg 1987). Moreover, the studies suggest 
that there is a wide variation in the roles stepparents play (White 1993). In a 
1987-1988 national survey, half of all stepfathers disagreed with the statement, 
“A stepparent is more like a friend than a parent to stepchildren”; one third 
agreed; and the rest were neutral (Marsiglio 1992). Often, those stepparents 
who manage to integrate themselves into the stepfamily household successfully 
play a role somewhere between that of a parent and of a trusted friend—what 
Papernow calls an "intimate outsider (Papernow 1988)." 


EFFECTS ON CHILDREN 


Fifteen years ago, the two of us thought that remarriage would improve the 
overall well-being of children whose parents had divorced. For one thing, when 
a single-mother remarries, her household income usually rises dramatically 
because men's wages are so much higher, on average, then are women's wages. 
One national study found that 896 of children in mother-stepfather households 
were living below the poverty line, compared to 49% of children in single- 
mothers households (Bachrach 1983). Consequently, if a divorce causes a 
decline in household income that hurts the well-being of children, then an 
increase in household income after the mother remarries should improve chil- 
dren's well-being. In addition, the stepparent adds a second adult to the home. 
He or she can provide support to the custodial parent and reinforce the custodial 
parent’s monitoring and control of the children's behavior. A stepparent also 
can provide an adult role model for a child of the same gender. 

Despite these advantages, many studies now show that the well-being of 
children in stepfamily households is no better, on average, than the well-being 
of children in divorced, single-parent households. Both groups of children 
show lower levels of well-being than do children in two-biological-parent 
families. For example, psychologists Mavis Hetherington, Glenn Clingempeel, 
and several collaborators studied about 200 white households, divided into 
three groups: nondivorced two-parent households; divorced, single-mother 
households in which the mothers had been divorced for about four years, on 
average; and stepfamily households that had just formed (four months average 
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duration) and in which the wife was the biological parent and the husband was 
the stepparent. The sample was not selected randomly but rather recruited by 
such means as advertisements, examining marriage records, and sending no- 
tices to community organizations. All households had at least one child be- 
tween nine and thirteen years old; these early adolescents were the main focus 
of the study. Households were evaluated using multiple methods, including 
personal interviews with the parents and children, standardized tests given to 
the children, and videotaped family problem-solving sessions. Evaluations 
were conducted three times: at the start of the study, again about a year later, 
and yet again another nine months later (Hetherington & Clingempeel 1992). 

At all three evaluations, the children from both the single-mother and re- 
married households were not faring as well as the children in the nondivorced 
households. For example, all the mothers were asked which items on a list of 
behavior problems applied to their early-adolescent child. Scores above a 
certain level on this widely used behavior problems checklist are said to 
indicate serious difficulties that might warrant the assistance of mental health 
professionals. Even at the last assessment, about 25 to 30% of the children in 
the single-mother households and the stepfamily households were above this 
level, as opposed to 1096 or less of the children in nondivorced households. 
There was little difference between the former two groups (Maccoby 1992). 

A national health survey of 15,000 children in 1981 produced similar results. 
Children in stepfamily households and in single-parent households both re- 
ceived higher average scores on a checklist of behavior problems than did 
children in nondivorced, two parent households. When parents were asked 
questions about the need for psychological help for their children, 396 of 
nondivorced parents said that their child needed help or had received help in 
the previous year, compared to 10% in single-mother households and 10% in 
mother-stepfather households. Both of the latter groups had children who were 
more likely to have repeated a grade in school than did children from non- 
divorced households. On all of these indicators, there was little difference 
between the children in single-parent households and in stepfamily households 
(Zill 1988). 

There is conflicting evidence as to whether children of different ages or 
genders adjust differently to the arrival of a stepparent. Several studies, con- 
ducted mostly with younger children, have found that girls had a more difficult 
time adjusting to the presence of a stepfather than boys did adjusting to a 
stepmother (Bray 1988, Hetherington 1987). Some of the authors speculated 
that girls tend to form close bonds to their divorced mothers and that these 
bonds are disrupted by the arrival of a stepfather. In support of this idea, at 
least two studies found that daughters showed poorer adjustment when their 
mothers and stepfathers reported greater cohesion and bonding in their mar- 
riage; conversely, they showed better adjustment when there was less cohesion 
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in the marriage (Brand et al 1988, Bray 1988). It is as if the daughters’ sense 
of well-being falls at least temporarily when their mothers turn some attention 
and affection toward their new husbands. 

However, the 1981 national health survey showed few differences by either 
age or gender. Neither did Hetherington & Clingempeel's recent joint study 
find differences between the early-adolescent girls and boys in their study, 
despite the expectations of the authors, each of whom had reported gender 
differences among pre-adolescent children in prior studies. Instead, Hether- 
ington & Clingempeel now speculate that it is difficult for early adolescents 
of either gender to adjust to a remarriage. This period is when children must 
come to terms with their own burgeoning sexuality; it may be disconcerting 
to have an adult sexual partner of the parent move into the house—especially 
one for whom the traditional incest taboos do not hold. Alternatively, the 
number of family transitions might impair the adjustment of children in step- 
families. Having coped with a divorce, and possibly with the introduction of 
a live-in partner, these children must now cope with another major change in 
their family system. Some studies have found a relationship between the 
number of family transitions a child has experienced, on the one hand, and 
behavior problems, on the other hand (Wu & Martinson 1993, Capaldi & 
Patterson 1991). Finally, children and parents with unmeasured personal char- 
acteristics that impair family cohesion could be disproportionately represented 
in the population of divorced and remarried families. No study to our knowl- 
edge has done an adequate job of examining how much of the effect of marriage 
is due to selection. 

Only one finding is well-established concerning the long-term effects on 
children of having lived in a stepfamily household. Children in stepfamily 
households—particularly girls—leave their households at an earlier age than 
do children in single-parent households or in two-parent households. They 
leave earlier to marry; and they also leave earlier to establish independent 
households prior to marrying. An analysis of a large, six-year, national study 
of high school students showed this pattern for girls (Goldscheider & Gold- 
scheider 1993). In a British study, 23-year-olds who had left their parental 
homes were asked the main reason why they left. Demographer Kathleen 
Kiernan (1992) reported that those who had lived in stepfamily households 
were substantially more likely to say that they left due to “friction at home” 
than were those who had not lived in stepfamily households. Again, the 
differences were greater for girls. An analysis of a 1987-1988 American 
national survey found that girls who had lived in a stepfamily household were 
more likely to have left home by age 19 to marry or to live independently than 


In 33 of the states, it was legal in 1993 for a stepfather to divorce his wife and marry his wife's 
daughter. See Margaret Mahoney (1993) and Mary Ann Glendon (1989). 
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were girls who had lived with single parents or with two parents; the differences 
were much weaker for boys. If a girl also had lived with stepsiblings, her 
likelihood of leaving home by age 19 was even higher (Aquilino 1991). 

Interviews in 1980 and 1983 with a national sample of currently married 
persons suggested that tensions between stepchildren and tbeir parents and 
stepparents cause the early home-leaving. Those who had stepchildren in their 
households reported more family problems involving children. The authors 
hypothesize that one way these problems are resolved is by encouraging, or 
arranging for, the stepchildren to leave the household. During the three years 
between interviews, 5196 of all the teenage stepchildren had left the house- 
holds, compared to 3596 of all the teenage biological children. Some may have 
chosen to live with their other parent, some may have been forced to do so, 
and some may have left to go to school, establish their own residence, cohabit, 
or marry (White & Booth 1985). If this effect is indeed more pronounced for 
girls, it suggests that the "friction" in the household may be due to the disrup- 
tion of the mother-daughter bond or to the presence of the mother's male sexual 
partner, whose relationship to the daughter is ambiguous. 

Only a small amount of recent research on the consequences of children's 
experience in stepfamilies has looked at long-term effects. Even less of this 
literature has sorted out the complementary, compounding, and sometimes 
competing effects of divorce and remarriage/cohabitation. From evidence al- 
ready discussed, it seems likely that kinship bonds in stepfamilies are more 
fragile, less permanent, and not as significant. A thorough review of the 
literature by White (1993) provides a wealth of evidence showing that inter- 
generational ties between children and their stepparents are less robust than 
ties between children and biological parents. Nonetheless, a good deal of 
variation exists depending on conditions such as the length of time in stepfamil- 
ies, the quality of relations between the biological parents and their offspring, 
and the gender of the stepparent. In short, as White demonstrates, the effects 
of stepfamily life are far from uniform, and some portion of the presumed 
effect may be due to divorce rather than remarriage. 

It is important to recognize that some stepparents manage to build relations 
with their partner's children, though rarely so if their remarriage dissolves. 
Still, the odds of building durable and intimate bonds that resemble the strong 
ties that often occur among biological parents and children are relatively low. 
The discretionary quality of in-law relationships—especially relationships that 
have a legacy of conflict or emotional distance—often seems to dictate the 
kinship bonds in later life. 

It strikes us that the comparison of stepchildren and biological offspring is 
not always the most appropriate one. Perhaps it is more reasonable to contrast 
the sense of obligations that stepparents and their adult children experience 
toward one another to obligations assumed by sons and daughters-in-law. By 
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that standard, we may find that relations between stepchildren and their parents 
are not so impoverished. It would also be interesting to examine the relatively 
small number of stepchildren who are adopted by their parents (who become 
treated like "blood" relatives) with adopted children whose membership in the 
family was not gained through divorce and remarriage. 

The structural comparisons of family types also need to be supplemented 
with qualitative research on the circumstances under which parents and chil- 
dren socially construct family bonds and how those constructions change over 
time. Clearly, some parents and children experience their steprelations as 
similar to ties with blood relatives. How do these individuals manage to ignore 
the signals of a culture that places strong values on blood ties? In the conclu- 
sion, we argue that such individuals may become more common in the future. 
The significance of blood bonds may wane to some degree if the institution 
of marriage continues to weaken. 


THE RISK OF RE-DIVORCE 


Throughout this commentary on the research literature about stepfamily life, 
we have repeatedly referred to the cultural, structural, and emotional complex- 
ities associated with family formation after divorce, particularly for couples 
with children. As mentioned in the introductory section, Cherlin's 1978 article 
suggested that the everyday problems associated with building and maintaining 
a stepfamily might generate marital strains, accounting for the higher risk of 
divorce among remarried persons. 

Recent studies confirm that remarriages are somewhat more likely to end 
in divorce than are first marriages. After ten years, about 3796 of remarriages 
have dissolved, compared to about 3096 of first marriages (Sweet & Bumpass 
1987). The difference is concentrated in the first several years, during which 
time people in remarriages have substantially higher rates of divorce than do 
people in first marriages. Cherlin's hypothesis has been the source of some 
debate among researchers. One of the first researchers to raise questions about 
the thesis that “incomplete institutionalization" of remarriage led to higher 
rates of divorce was Furstenberg, who, along with Graham Spanier, suggested 
an alternative explanation for the higher rates of divorce among remarriers. 
Remarried people, they argued, have ipso facto demonstrated a willingness to 
resort to divorce when unsatisfied in marriage; in contrast, many people in 
first marriages are reluctant to divorce regardless of their discontents with the 
relationship. Moreover, Furstenberg & Spanier argued that the experience of 
divorce may make individuals less inclined to remain in an unhappy remar- 
riage. So remarriage, they assert, involves a select group of people who may 
differ from first-marriers in their propensity to divorce (Furstenberg & Spanier 
1984). 
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Research published during the 1980s provides evidence that supports both 
views. For example, support for the Furstenberg-Spanier selection hypothesis 
was presented by Teresa Castro Martin and Larry Bumpass. They found that 
a majority of remarried people had begun their first marriages as teenagers 
(Martin & Bumpass 1989). They speculate that a teenage first marriage may 
be a sign of personality characteristics or other difficulties that make it harder 
for a person to choose a good spouse or to maintain a marriage. (It is known 
that first marriages begun in the teenage years have a high divorce rate.) In 
any case, Martin & Bumpass found that the greater number of teenage first- 
marriers in the remarried population accounted for much of the higher divorce 
rate among the remarried. Another recent national study found support for both 
views: In a three-year longitudinal study, Alan Booth and John Edwards (1992) 
found that both the presence of stepchildren (a measure of stepfamily com- 
plexity) and a teenage first marriage (a measure of the selection of particular 
kinds of people into remarriage) accounted for portions of the higher divorce 
rate among the remarried. 

The debate that we helped to stimulate must now be reformulated to consider 
the growing number of couples who avoid marriage altogether. As we have 
repeatedly observed, the distinctions between first married, remarried, and 
never-married may be blurring as the link between marriage and childbearing 
becomes more tenuous and as more couples elect to cohabit before (or instead 
of) marrying again. These changes make it harder still to detect the sources of 
instability in patterns of family reconstitution. 


IMPLICATIONS FOR RESEARCH AND PUBLIC POLICY 


From the perspective of children, these demographic changes increase the odds 
of experiencing substantial family flux during childhood. A near-majority of 
children growing up in the United States today are likely to encounter multiple 
parent figures. Many must negotiate changing relations with these parent-fig- 
ures as the children or their parents move in and out of their households. The 
children will be situated in complex and changing kinship networks that 
involve the loss and acquisition of relationships. On average, children in 
stepfamilies and quasi-stepfamilies exhibit more problems on average than do 
children who grow up in nuclear families. Nevertheless, many children expe- 
riencing the divorce and remarriage of their parents appear to do well. 
Research on children whose family experiences include divorce and remar- 
riage is just beginning to identify some of the family processes that make a 
difference for children's long-term well-being. It seems likely to us that the 
same processes that make for successful development of children in two-bio- 
logical-parent families are likely to apply to children who grow up in stepfamil- 
ies. Some of these processes reside in the child, some in the parenting system, 
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and some in the resources of the larger family system and community in which 
parent and child are embedded. The task for researchers is to map these specific 
sources of influence and to describe how they work in combination. This is a 
daunting challenge for developmentalists and family sociologists. 

The family changes that we have discussed in this paper represent part of 
a profound cultural transformation in the American kinship system. Elsewhere, 
separately and together, we have provided a fuller description of the sources 
and consequences of this transformation. Here we can only make a few ob- 
servations that are especially relevant to the concerns of researchers. 

During the past several years, Americans have witnessed on television and 
in the newspapers a steady stream of dramatic custody battles between bio- 
logical and nonbiological parents. Not all involve conflicts resulting from 
divorce and remarriage, but in nearly every case the same issue recurs: Do 
blood ties have legal primacy over bonds created by the emotional investment 
of a nonbiological parent? 

David Chambers, a legal scholar who has written about this issue, describes 
a court case that involved a boy named Danny, who was raised by his stepfather 
from the age of one, after his mother died. His biological father had not asked 
for custody initially. However, when Danny was seven, his older brother 
decided to live with the biological father. The father than sued for custody of 
Danny on the grounds that siblings should not be separated. A lower court 
ruled that Danny should be allowed to stay with his stepfather, who had been 
the primary parent for six of the seven years of Danny's life. But a higher 
court overruled this decision, referring to the stepparent as merely a “third 
party" who should not be allowed to interfere with the rightful interest of the 
biological parent. 

Concerning the dismissive treatment of stepparents by the courts, Chambers 
wrote tbat the ruling in Danny's case almost certainly exposes "society's 
conflicting and unresolved attitudes about stepparents." In fact, the legal doc- 
trine that persists can be traced back to a longstanding belief that biological 
parents are better equipped to care for their children. Probably, too, it reflects 
the idea that children are the property of their biological parents. 

These legal assumptions evolved from, and helped to sustain, a family 
system that was culturally and socially designed to reenforce the primacy of 
biological ties. Or, to put it differently, these assumptions emphasize ascriptive 
affiliations as opposed to earned or achieved affiliations. Throughout this 
paper, we have argued that the justification for giving such heavy weight to 
ascriptive ties is being seriously undermined by new patterns of family forma- 
tion. It may be time to reconsider the doctrine that family is largely determined 
by "blood" and to assign, rather, a higher importance to the emotional, social, 
and material resources that parents, biological and nonbiological, provide. In 
other words, perhaps we should regard parenthood both as an achieved and an 
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ascribed status. Perhaps we should require that parents earn rights to their 
children by assuming responsibilities in caretaking and support. Of course, 
biological parents must be given an opportunity, indeed they must be expected, 
to assume those responsibilities. Perhaps, however, we should also accept the 
possibility that other parent-figures may supplant biological parents when 
circumstances permit or require their involvement (Woodhouse 1993). 

Of course, we are aware that multiple parent-figures complicate both the 
legal system and the parenting system. But the idea that two parents per 
household is the standard and the only acceptable family form is giving way 
to a more diverse set of family arrangements that are not so neatly confined 
to a single household. The change in family forms that we have been tracing 
introduces a host of anomalies. We see little evidence that remarriage (formal 
and informal) has become more institutionalized since the two of us first began 
to write about this growing phenomenon. We see some troubling indications 
that the cultural, legal, and social anomalies associated with "recycling the 
family" place a considerable burden on a growing number of children—even 
if most children seem capable of managing that burden without serious effects. 

To describe this new family system is not necessarily to endorse it. But 
neither can we say that the old order is invariably to be preferred. Moreover, 
even if we believed that the old order were preferable, we cannot imagine how 
it could be easily restored without considerable costs to all. 
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Abstract 


While social theory and legal theory were once closely intertwined, contem- 
porary American sociology pays scant attention to recent developments in legal 
theory. But the problems that legal theory currently wrestles with are very 
similar to those with which sociology is now centrally concerned. This essay 
reviews major schools of thought in contemporary legal theory to introduce 
sociologists to some potentially useful literatures on the meaning of rationality; 
on critical theory; on the importance of gender, race, and class in understanding 
social institutions; on the interpretive turn; on the relationship between struc- 
ture and agency; and on the revival of pragmatism. 


"[S]ociology [is] the ghost of jurisprudence past.” 
Donald R. Kelley, The Human Measure (1990: 275) 


INTRODUCTION 


Annual Review articles customarily begin with a statement about the current 
salience of the field under review for the discipline of sociology. I cannot make 
any such claim. Legal theory (jurisprudence) is a field that is today largely 
unrecognized by sociologists, even sociologists of law. Why then devote an 
Annual Review paper to a field that is not prominently on the intellectual map 
of the discipline? 

One answer is that sociology as a distinct discipline grew out of jurispru- 
dence and maintained a close alliance with jurisprudence for most of the 
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nineteenth century. Many early European social theorists studied law at uni- 
versity because sociology did not exist as an independent field of study. 
"Indeed," as historian James Q. Whitman has noted, "it is a striking fact that 
when modern social science finally appeared, it appeared not among theolo- 
gians or philosophers, but among lawyers." (1991: 205). Understanding soci- 
ology's past requires understanding jurisprudence. 

Another answer might be that legal theory has the potential to be important 
to contemporary sociology because legal theorists and social theorists have 
relevantly similar agendas these days. The main object of legal theory is legal 
doctrine, the set of concepts and categories that law students learn in law 
schools: constitutional provisions, statutory enactments, administrative regu- 
lations, precedents. But sociology is itself increasingly focused on cultural 
products as well: representations, mentalités, texts, and images. The ways in” 
which legal theorists think about legal doctrine may be of more than passing 
interest to sociologists who are thinking through cultural formations. 

But there is something even deeper about the contemporary connection 
between legal theory and social theory. Both try to reconcile the same ten- 
sions—bdetween the written word and social practices, between structure and 
agency, between the normative and the descriptive, between formal elegance 
and descriptive adequacy. Just as the nineteenth century social theorists turned 
to law to see how society was organized, late twentieth century social theorists 
may look to law to find fellow travellers who are also trying to understand the 
complexity of the social world, 

That said, I should provide one note of caution to sociologists about to 
undertake extensive reading in jurisprudence. Legal theorists often find that 
the same questions sociologists ask need different sorts of answers. Legal 
theorists tend to go back and forth between descriptive and normative argu- 
ments more fluidly than most social theorists (though feminist social theorists 
are one notable exception to this). Many legal theorists also accept certain 
contingent features of legal systems as being more fixed than sociologists 
would be inclined to imagine, sometimes conveying the impression that courts 
are the only institutions in the universe or that American rules of civil procedure 
(for example) are as fixed as fortresses and as unremarkable as gravity. Nev- 
ertheless, as I hope to show in this essay, social theorists may well find that 
legal theory is quite relevant for answering pressing questions in contemporary 
sociology. 


JURISPRUDENCE AND SOCIOLOGY: A BRIEF HISTORY 


Marx, Durkheim, and Weber had strong and deep ties to the law; Marx and 
Weber received their university educations in law, and Weber spent seven 
years being miserable while practicing it. Durkheim wrote about law through- 
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out his career. He believed that law was the preeminent example of a social 
fact and that the evolution of societies could be traced through the relative 
elaboration of civil and criminal law (Durkheim 1964, Lukes & Scull 1983, 
Hunt 1978, Grace & Wilkinson 1978). Marx thought that class was most 
importantly defined through the legally constituted relationship of ownership 
of property and that law could best be understood as an ideological formation 
(Cain & Hunt 1978, Collins 1984). For Weber, the modern age was charac- 
terized primarily by the triumph of rational-legal authority in which social 
institutions were progressively conquered by norms modelled on formal legal 
procedure (Weber 1978, Kronman 1983, Tronto 1984). 

What law represented for Weber, Marx, and Durkheim was not just a set of 
institutions and professions (though these were important in their own right), but 
it was also a system of ideas: rational legality, ideology, doctrine. Law was 
important because it provided an intellectual framework within which bureau- 
crats, capitalists, and common people thought about and acted in the social world. 

The idea that law was an important subject of academic study was common 
in nineteenth century Europe. The term "jurisprudence," both then and now, 
has at least two distinct meanings that are often intertwined: (i) the body of 
scholarship that theorizes about law (legal theory), and (ii) the set of rules, 
principles, and official pronouncements that constitute "the law" as a substan- 
tive field (legal doctrine). Nineteenth century social theory drew freely from 
both because jurisprudential writing at the time took for granted that legal 
doctrine should be the main object of legal theory. Legal scholars like Mon- 
tesquieu (1748), Maine (1864), Savigny (1829), and Gierke (1913) traced 
broad patterns in the history of legal doctrine to illuminate equally broad 
conceptual categories of legal theory. Marx, Weber, and Durkheim also ex- 
amined the historical development of law, believing that social theory should 
comprehend legal forms (Kelley 1990). 

It is no wonder that European social theory had its origins in legal scholar- 
ship. The nineteenth century continental social theorists were writing in a time 
that was enthusiastic about general codes. Reading the French Civil Code of 
1804 or the German Civil Code of 1900, one can see that legal doctrine is like 
a rough draft of social theory, comprising concepts, categories, rules and 
procedures for managing the vast array of human conduct in an orderly and 
systematic way. A comprehensive system of contract law reads like general 
social theory, for example. It specifies types of social actors, the ways they 
may interact with each other, what they may expect, what they may do when 
expectations are violated. Legal rules are the general principles for understand- 
ing and managing such interactions. The social theory that arose from famil- 
iarity with legal doctrine also engaged in systematic examination of social life, 
specifying the same sorts of things that nineteenth century legal codes did in 
a Systematic and comprehensive way. 
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American sociology, however, never had such strong formal ties to juris- 
prudence. Concern with law as an intellectual enterprise, so prominently fea- 
tured in continental social theory, gave way in the new American urban 
sociology to the empirical study of deviance, criminality, and social control. 
Not that these things weren't important; they were crucial in understanding 
the problem of social order that had become central to American sociology. 
But once the historical study of law was transformed into the contemporary 
empirical study of criminal behavior, doctrinal ideas and legal ideals became 
less prominent in social theory. 

Within American legal scholarship, however, the new empirical sociology 
was influential. Oliver Wendell Holmes explicitly distanced himself from 
Continental abstract theorizing by urging his colleagues to focus on “our friend 
the bad man” for “we shall find that he does not care two straws for axioms 
or deductions, but that he does want to know what the Massachusetts or English 
courts are likely to do in fact" (Holmes 1920:172-73), Harvard Law School 
Dean Roscoe Pound launched a movement in sociological jurisprudence that 
concentrated on the relation between law and social interests, seeing the de- 
veloping social sciences as crucial in the understanding of legal forms (Pound 
1911-1912), The laissez-faire attitude of sociological jurisprudence eventually 
gave way to the New Deal theorizing of the Legal Realists, for whom theory 
took as its object the way law worked in practice, breaking down the intellec- 
tual walls between law, politics, society, and culture. While mainstream Amer- 
ican legal thought had previously emphasized the pure logic of abstract legal 
concepts through legal formalism, the Realists and their precursors were at 
pains to demonstrate that law was a human product contingent on time and 
place (Hunt 1978, Harris 1980). 

European social theory had an influence on the Legal Realists, but not 
through the mediation of American sociologists. Instead, Columbia law pro- 
fessor Karl Llewellyn, who discovered Weber’s writings on a trip to Germany 
in 1927 (Ansaldi 1992), provided one link (Llewellyn 1930, 1962). Weber had 
noted that “order will be called ... law if it is externally guaranteed by the 
probability that physical or psychological coercion will be applied by a staff 
of people in order to bring about compliance or avenge violation” (1978: 34), 
and a similar view of law echoes eerily through the American Legal Realist 
movement (Rumble 1968, Frank 1930, 1949, Cohen 1935). Llewellyn’s lec- 
tures to the entering class of the Columbia Law School in 1930 borrowed 
Weber’s emphasis on the importance of legal professionals in defining the law: 
“What [legal] officials do about law is, to my mind, the law itself” (1930: 3). 
This emphasis remains in socio-legal scholarship to this day. 

In the meantime, back in American sociology, European social thought 
reappeared through a different line of transmission. Through Parsons’ The 
Structure of Social Action and later through strategic translations of the works 
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of Durkheim, Weber, Marx, and Simmel, American sociologists adopted the 
newly defined classics, but without the background to see just how these ideas 
resonated in European jurisprudence. Sociology appeared as a free-standing 
field with law as one possible subject; law was not seen as an intellectual 
tradition that had contributed in important ways to the development of social 
theory in the first place. 

Numerous attempts in this century to provide a theoretically ambitious 
general social theory based in jurisprudence have been largely ignored by the 
discipline (Ehrlich 1913, Timasheff 1939, Hall 1963, Gurvitch 1973, Aubert 
1969). More successful were attempts to link the sociology of law more directly 
to the classic writers: Durkheim's legacy was kept alive and extended in 
different ways by Schwartz & Miller (1964), Black (1989), and Evan (1990); 
Philip Selznick's discussion of the pervasiveness of norms of legality (1968) 
followed strong themes in Weber; Pashukanis (1929), Balbus (1977), Cham- 
bliss & Seidman (1982), and Thompson (1975) followed Marx. A group of 
scholars working in the 1960s and interested in combining social science and 
legal scholarship laid the foundation for what became the law and society 
movement, in which empiricism became the rallying cry (Friedman 1986). 
Demonstrating that "law in action" was different from "law on the books," 
early law and society scholars rejected normative jurisprudence and the doc- 
trinal scholarship prevalent in law schools at the time. further estranging 
sociology of law from legal theory. 

From the 1970s on, however, theoretical ferment in law and in social science 
brought the agendas of jurisprudence and general social theory closer together 
again. Theorists like Coleman (1990), Bourdieu (1987), Foucault (1977), and 
Luhmann (1985, 1988-1989) explicitly turned their attention to legal forms. 
Sociolegal scholars like Lempert & Sanders (1986), Post (1988, 1989), Simon 
(1993), and members of the Amherst Seminar on Legal Process and Legal 
Ideology (Silbey 1985, Sarat & Felstiner 1986, Yngvesson 1989, Merry 1990) 
embraced social theory in the course of studying law. Recently, the North- 
western Law Review published a symposium issue on Law and Social Theory 
in which notable social theorists and legal scholars explored the connection 
(1988-1989, with overviews by Mertz 1988-1989, van Zandt 1988-1989 and 
Calhoun 1988-1989). Social theory is having an increasing effect in law review 
articles, and legal theory is increasingly seeping into sociological writing, 
particularly in law and rationality studies and feminist theory. 

But the overlap of potential mutual interest is even larger than this. In this 
essay I track some of the connections, outlining the major lines of work in 
contemporary legal theory: law and rationality studies, critical jurisprudence, 
the literary turn, discursive structuralisms and pragmatism. My discussion 
proceeds roughly chronologically, both across sections and within 
sections. 
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LAW AND RATIONALITY STUDIES 


"Law and rationality" includes various schools of law and economics as well 
as the new public choice theories. They share a focus on individual choice and 
human rationality as the building blocks of social theory. Launched in the early 
1960s, this field has since become crucially important in American law, and 
during the Reagan years many prominent theoreticians in the law and econom- 
ics movement were appointed to the federal bench. 

The growth of law and economics has paralleled the growth in legally 
informed rational choice theories in sociology, particularly those of Coleman, 
Elster, and Buchanan & Tullock. Coleman’s The Foundations of Social Theory 
(1990) envisions social action as a system of rational decisions that both 
operate within an institutional framework and form the institutional framework 
itself. Coleman analyzes the way social context is derived from and influences 
the aggregation of individual decisions, through the legal ideas of sovereignty, 
rights, authority systems, and juristic persons. Elster uses his work on the 
limitations of a strict rationality model to analyze questions about constitu- 
tionalism and institutional design (1988), seeing constitutional drafting pro- 
cesses as a "gigantic natural experiment" for analysis of "bargaining, threats 
and warnings" (1991:447). Buchanan & Tullock (1962) applied public choice 
theory to the law. These approaches have a strong affinity with law and 
rationality studies, which divide into several schools. 


The Chicago School 


When Coase first published The Problem of Social Cost (1960), the field of 
law and economics was barely a blip on the intellectual map. Coase argued 
that in a world with no transaction costs, the initial assignment of property 
rights was irrelevant to the final distribution of property. And despite the fact 
that this looks like an argument that law makes no difference, the article is 
widely credited with launching the field, particularly the version associated 
with the University of Chicago and the Journal of Legal Studies. 

The Chicago school of law and economics evaluates legal rules on the basis 
of their abilities to promote the efficient use of scarce resources and, in some 
versions, to produce the greatest amount of wealth in the society as a whole 
(Posner 1992). Legal rules should, and common law rules largely do, mirror 
what would happen if an error-free market could determine the outcomes, 
according to Chicago school theorists. Their approach is relentlessly individ- 
ualistic, using Gary Becker's particular version of microeconomics for most 
of the theoretical framework (Becker 1976). It is also very court-centered, 
focusing on legal doctrine as it has been elaborated by judges as the main 
object of study. Most of the time, the concrete results of the Chicago school 
writers are consistent with a libertarian conservativism and, not surprisingly, 
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the approach has generated a large critical literature (Leff 1974, Scheppele 
1988), attacking it for being insufficiently sensitive to the variety of values in 
the law. 


The Yale School 


Guido Calabresi's book The Cost of Accidents (1970) launched the Yale school 
much the same way that Coase launched the Chicago school. Generally, a 
group of writers clustered around the Yale Law School and around the Journal 
of Law, Economics and Organization has been concerned with questions of 
institutional design (Williamson 1975), choices of legal rules (Calabresi & 
Melamed 1972), and ethical issues in institutional arrangements (Coleman 
1988), rather than with working out ideal results from free-market assumptions. 
The approach tends to be quite institutional and realistic, focusing in particular 
on tbe relationship between the administrative state and economic formations. 
In contrast with the Chicago School, which views government largely as a 
pathology interfering with well-functioning markets, the Yale School sees 
public institutions as one organizational form among many which might handle 
a particular social problem. Rose-Ackerman, for example, ergues for a reform- 
ist theory of the administrative state that starts from welfare economics rather 
than from microeconomics (1988). Such an approach generates a concern for 
social effects as well as individual effects. 


The Legal Incentives School 


Economists in this group focus on the effects of legal rules on the incentives 
of individuals who are engaging in rational action. As evidenced in the work 
of Shavell on tort law (1987), Mnookin & Kornhauser (1979) on bargaining 
in divorce cases, and in the Polinsky (1989) and Cooter & Ulen (1987) text- 
books, scholars in this group use rational choice models to derive the expected 
effects of alternative legal rules, and then use welfare economics to assess 
which of competing legal rules produces the most desirable results. Applying 
the methods of public policy analysis to the study of legel rules, economists 
of this sort take law to be one of many sources of incentives, constraints, and 
opportunities. 


The Public Choice School 


Starting with Buchanan & Tullock (1962), public choice theory examines the 
collective consequences of individual choice. In legal circles, this traditionally 
meant thinking about the ways in which individual rationality produced col- 
lectively irrational results. But public choice has entered the theoretical fray 
in law schools most recently with the increasing concern about the rationality 
of the legislative process and its impact on statutory drafting and interpretation 
(Farber & Frickey 1991), the rationality of administrative agency decisions 
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(Eskridge & Ferejohn 1992), and the rationality of judicial review (Ferejohn 
& Weingast 1992). Critics have complained that these theories fail to take into 
account important differences across institutions (Rubin 1991), but the ability 
to use a theory that applies to all institutions regardless of difference is one of 
the main strengths, proponents claim. 


Law and rationality studies have had an enormous impact on law. Many 
practioners of this line of scholarship are now federal judges, applying their 
theories in the course of their judicial decisions. Virtually every major law 
school has an economist on the faculty to teach economics to law students. 
And law and rationality studies have had an important influence in rational 
choice theories in sociology, where law is now more prominently featured as 
an institutional constraint. In this area, at least, aw and sociology are on quite 
good terms with each other. 


CRITICAL JURISPRUDENCE 


Politically and methodologically the opposite of law and rationality studies, 
critical jurisprudence is also flourishing in law schools. Critical jurisprudence 
comes from the academic left, parallelling the growth of cultural studies 
(Grossberg et al 1992) and producing an enormous range of writings on law 
and oppression and on ways of making legal institutions more democratically 
responsive. Critical jurisprudence embraces a far-flung and large range of 
scholars who identify themselves as being engaged in critical legal studies, 
feminist theory, and/or critical race theory. Many critical theorists identify with 
more than one of these labels, and much exciting intellectual activity these 
days is happening at the boundaries (Menkel-Meadow 1988, Harris 1990, 
Rhode 1990, Crenshaw 1989, Torres 1988, Trubek 1984, Johnson 1991). 


Critical Legal Studies (CLS) 


Starting as a social movement within American law schools in the late 1970s, 
the group called “The Conference on Critical Legal Studies" created an intel- 
lectual home for radical law teachers and by one count in 1989 claimed over 
700 articles (Hutchinson 1989). Critical legal studies (CLS) is "influenced by 
a variety of currents in contemporary radical social theory, but [CLS] does not 
reflect any agreed upon set of political tenets or methodological approaches" 
(Kennedy & Klare 1984: 461). Nonetheless, certain critical themes emerge 
that capture the spirit of much CLS writing. 


ATTACK ON LIBERAL LEGALISM Many CLS writers believe that liberal legal 
theory, with its conceptions of rights, neutrality, and procedural justice, is an 
ideological cover for decisions governed by power and the maintenance of 
inequality (Unger 1986). By speaking in abstract terms, liberal legalism dis- 
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guises its own fictions: that rules can decide concrete cases, that legal subjects 
are autonomous individuals, that intentions are sufficient to explain social 
action (Kelman 1987). Liberal legalism is exposed in CLS writings as a 
justification machine that serves primarily to reproduce social inequality. (But 
at least one critic complains that the liberalism CLS attacks is not a version 
most liberals are defending—Herzog 1987). 


THE INDETERMINACY THESIS When judges decide cases, they claim that their 
results are compelled by the law. But actually, CLS writers argue, law consists 
of a whole variety of contradictions and inconsistencies, allowing decisions to 
go either way. Law is logically indeterminate and fails to compel a particular 
result (Singer 1986). Therefore, judicial decisions cannot be the self-contained 
models of reasoning that they pretend to be. Instead, they must rest on grounds 
outside of formal legal doctrine, grounds which are inevitably political (Ken- 
nedy 1979, Dalton 1985, Tushnet 1983, Klare 1978). 


THE RELATIVE AUTONOMY OF LAW Having unmasked (or "trashed") liberal 
legal theory and shown that cases cannot be compelled by doctrine, CLS 
writers argue that law is inevitably tied to politics. Borrowing heavily from 
both Marx and Weber, critical legal scholars have elaborated a view that law 
is "relatively autonomous," meaning that although law uses a special form of 
argumentation that makes it distinct from ordinary politics, it always serves 
political interests and purposes (Kairys 1990, Gordon 1984). The framework 
of relative autonomy helps to explain why law may appear to be a self-con- 
tained logical system—it does develop according to its own internal rules of 
operation—but also why law must always be tied to the political. 


LEGAL CONSCIOUSNESS Law operates by producing in those subject to it or 
trained in its use a specific legal consciousness. CLS writers have analyzed 
both the way in which law sets the terms within which the world is seen and 
the efforts of those dominated by such visions to escape from these ideologies 
(Kennedy 1980, Klare 1978). 

These themes combine to form a basis for a critical legal practice, devoted 
to assisting the oppressed and to challenging power, particularly when it 
masquerades as inevitability. CLS scholars unmask tbe apparent neutrality of 
legal premises and show precisely how legal ideas and ideals are constantly 
being reconstructed to hide their own agendas. And CLS scholars often show 
the subaltern, subordinated discourses present alongside the authorized ones. 


Feminist Jurisprudence 


Feminist jurisprudence is centrally concerned with the influence of gender and 
gendered conceptions of the world on law and vice versa. Since law is a crucial 
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site in which the fight for women's equality has been carried out, feminist 
jurisprudence has been important in the development of general feminist theory 
outside of law. Recent collections reveal the impressive vibrancy of feminist 
jurisprudence (Bartlett & Kennedy 1991, Weisberg 1993, Frug 19922, Gold- 
stein, 1992). 

Most feminists share the view that society is patriarchal, organized and 
dominated by men, and therefore not necessarily hospitable to women. This 
dominance can be seen in the way in which gender (the social meaning given 
to biological sex differences) is employed to mark hierarchy. The role of gender 
is particularly crucial in law since law regulates all other institutions in a 
society. Starting from important analyses of areas of the law that were most 
obviously about women—abortion (Ginsburg 1985, Law 1984, Siegel 1992), 
rape (Estrich 1987, Olsen 1984), domestic violence (Mahoney 1991), preg- 
nancy (Williams 1984—5), sexual harassment (MacKinnon 1979), employment 
discrimination (Schultz 1990), work and family issues (Olsen 1983), divorce 
(Fineman 1991), sexual orientation (Robson 1992), child custody (Fineman 
1988), and pornography (Dworkin 1981)— feminist legal theory is now seeing 
the ways in which law is gendered all the way down, from topics not tradi- 
tionally thought of as presenting feminist issues, like contracts and torts (Dalton 
1985, Bender 1993), to legal education (Menkel-Meadow 1988), to legal 
methodology (Bartlett 1990), to legal practice (L White 1990, Davis 1991, 
Scheppele 1992). Feminist legal theory is not only about women; feminist 
legal theory is now a general theoretical approach to law. 

So what does this approach include? Feminists disagree too much to have 
a common set of answers; what they share is a common framework within 
which feminist debates occur, emphasizing equality and difference, relations 
of power, complexities of social position, and subjectivity. They also share a 
concern with the common question: How can women's situation best be 
bettered? 


EQUALITY VS DIFFERENCE On this question, feminists divide between those 
who favor treating women equally (in other words, just like men) (Ginsburg 
1985) and those who favor treating women differently from men, in order to 
ensure equality in results (Littleton 1987, Minow 1990). Breaking new ground, 
"equality feminists" first achieved a series of legal victories in the areas of 
employment discrimination. This ultimately proved unsatisfying, however, 
when other barriers, like women's unique reproductive capacities, could not 
be handled within a strict equality framework. If women had to be treated just 
like men, then how could they be given maternity leave or pregnancy benefits? 
“Difference feminists” then argued that formal similarity of treatment would 
never be enough, because women and men are relevantly different. Such 
differences needed to be taken into account in fixing legal rules, “difference 
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feminists" argued, or these rules would simply reproduce inequality. Equality 
could be achieved through differential treatment. 


DOMINATION AND SUBORDINATION Into this debate over equality and differ- 
ence came another argument: perhaps what mattered was not the formal rela- 
tion of comparison between men and women, but rather their power relative 
to each other. Men dominated; women were subordinated. From this, argued 
Catherine MacKinnon, one could attack the means through which men's dom- 
ination is achieved: control over sex (MacKinnon 1989). In domination theory, 
the task of feminism is to end the subordination of women, not simply to make 
men and women "equal" in some abstract sense (Scales 1986, Colker 1986). 
Everyone could be equal and oppressed; the task of dominance feminism was 
to end the oppression. 


ESSENTIALISM VS ANTIESSENTIALISM But while difference and domination 
were becoming the central analytical tools of contemporary legal feminism, 
other differences and relations of domination among women themselves began 
to become apparent. Both dominance and difference theorists were often ac- 
cused of being essentialist—that is, of defending an idea of woman's "essence" 
that was the same for all women. Though few feminists overtly defend essen- 
tialism, something like essentialism was presumed in early feminist theory as 
a way of sharpening the primary contrast between men and women. Anti- 
essentialist theory celebrates difference among women and refuses to see 
"woman" as a unitary social category. This has clearly become the more 
dominant view in recent years within feminist legal theory, as feminists have 
examined questions of race, ethnicity, and sexual orientation (Harris 1990, 
Robson 1992, Matsuda 1989a, Crenshaw 1989, Spellman 1988). 


SUBJECTIVITY ANDRELATIONALITY Feminist legal theorists are also now ex- 
ploring different models of thinking about legal subjects (i) as requiring con- 
nections with others rather than seeking separation through claims of autonomy 
(West 1988, Minow 1990, Nedelsky 1989), (ii) as defined by their subjectivity 
rather than by objectivist conceptions (Matuda 1989b, Williams 1991), and 
(iii) as finding that the experience of subordination (Menkel-Meadow 1988) 
or the sense of an embodied self (West 1988) leads to special insight. These 
feminists are turning what were once thought to be liabilities of women into 
sources of strength and illumination, and in so doing, they are reconfiguring 
the subject of law. 


Critical Race Theory 


Like feminist legal theory and often closely aligned with it, critical race theory 
is centrally concerned with questions of oppression, difference, and equality. 
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Critical race theory had its official beginning in 1989 when a group of scholars 
of color held a conference in Madison, Wisconsin. Many of these scholars had 
been previously involved with critical legal studies or with feminist jurispru- 
dence, and the 1989 conference ratified what had already been the case for 
some time: Critical race theory was a major presence in American legal theory. 
Critical race theorists are a diverse group, speaking about many different 
areas of law in many different voices. Some central themes can be found, 
however, themes highlighted below in the words of critical race theorists. 


“LOOKING TO THE BOTTOM" (Matsuda 1989a): Critical race theorists gener- 
ally start from the observation that to be a person of color in America is to be 
oppressed, to have one's subjectivity colonized by others who either silence 
the opposition of persons of color or speak in their name. Being oppressed 
creates fundamental disadvantages for those who are so treated. Critical race 
theory must start from the bottom, by focusing on the experiences and situa- 
tions of oppressed people of color, giving voice to the concrete experiences 
of subalterns (Matsuda 1989a, Peller 1990, Johnson 1991). 


"WE SEE A DIFFERENT WORLD..." (Lawrence 1990): Because they share ex- 
periences of oppression, people of color see the world differently from those 
who have not had such experiences. With different vision and different voice, 
scholars of color can bring to legal analysis perspectives that were previously 
excluded (Lopez 1987, Bell 1987, 1992, Williams 1991). 


“A PLEA FOR NARRATIVE" (Delgado 1989) To bring these excluded perspec- 
tives to the law, some critical race scholars tell stories about their experiences 
or the experiences of other people of color to make their presence real in legal 
scholarship (Bell 1987, 1992, Williams 1991, Scales-Trent 1990, Matsuda 
1989b). Overcoming the abstraction of conventional legal analysis and the 
sanitized versions of facts presented in court decisions, the highly personal 
stories in these articles break through alleged neutralities. 


"EXPLAIN THE SOURCE AND STRENGTH OF MINORITY CONVICTIONS THAT COURTS 
.. ARE CAPABLE OF BIAS" (Davis 1989: 1559); Some critical race theorists 
trace the ways in which law undermines people of color, despite official 
rhetoric to the contrary. Lawrence explains the extraordinary pervasiveness of 
unconcious racism and shows how it is ignored by courts (1987). Austin traces 
the material conditions of young women of color and shows how courts fail 
to recognize them (1989). Davis shows how African Americans interpret legal 
decisions as “microaggressions” whose cumulative effect is large (1989). Har- 
ris analyzes the way that whiteness has been treated as a property right, 
excluding people of color from making privileged claims (1993). Matsuda has 
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identified “accent discrimination" by showing how the comprehensibility of 
an accent depends on the hearer as much as the speaker, but courts have not 
heard this yet (1991). 


“MULTIPLE CONSCIOUSNESS AS A JURISPRUDENTIAL METHOD" (Matsuda 
19892) People of color have an ambivalent relationship to law. Sometimes the 
law helps overcome discrimination, as when Williams reports that having a 
formal contract can limit the arbitrariness to which people of color are exposed 
(Williams 1991). But other times the law fails miserably. Matsuda counsels 
multiple consciousness, being able to think both inside and outside the law. 
Crenshaw also proposes a multiplicity of identifications in dealing with the 
combined effects of gender and race (1989). Such accumulated scholarship 
shows the profound and indelible mark of race in the law. 


Critical race theory has come in for criticism (Kennedy 1989), but as a 
recent bibliography demonstrates, it is more than alive and well (Delgado & 
Stefancic 1993). 


THE LITERARY TURN 


Like many disciplines in the 1980s, law reeled from the invasion of literary 
theory. The new literary theory hit law hard because it challenged fundamental 
assumptions about the stability of legal doctrine. In the standard view, law was 
supposed to govern by constraining the future with words written in the past. 
But the new literary theory showed that all texts, including legal ones, were 
unstable in their meanings, either because writing always represses what it 
cannot control (Derrida 1976) or because meaning varies with the relevant 
“community of interpretation" (Fish 1980). Of course, these observations were 
broadly consistent with what critical legal studies, feminist, and critical race 
Scholars were also writing about: that claims to objectivity and stability of the 
law were built on the power of law to suppress alternative viewpoints. The 
theoretical destabilization of texts and the debates that resulted can be dis- 
cussed in the general categories of legal interpretation, postmodernism, and 
narrative jurisprudence. 


Legal Interpretation 


If, as literary theory demonstrated, texts were unstable, then how could they 
be shored up again to provide authoritative interpretations? Of course, critical 
legal scholars were saying that such a thing could not be done at all (Tushnet 
1983, Levinson 1982). But other legal theorists tried to come up with ingenious 
ways to stabilize one of the embarrassing multiplicity of apparently "correct" 
interpretations any legal text could produce. Each of these efforts resulted in 
bringing in something from outside the text to accomplish this purpose, and 
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this is where law must be seen as caught in a larger web of cultural processes. 
Law could no longer be officially portrayed as a closed system of logic immune 
from social influences. 

Much of the fight occurred over interpretation of American law's master 
text: the US Constitution. Reagan's Attorney General Edwin Meese argued 
that the only responsible way for a court to interpret the Constitution was by 
reference to the intentions of the framers (Meese 1988, but see Brest 1980). 
Supreme Court Justice William Brennan argued that a court must inevitably 
engage contemporary concerns: "We current justices read the Constitution in 
the only way that we can: as twentieth-century Americans" (Brennan 1988: 
17). Into this debate plunged some substantial fraction of all jurisprudential 
writing published in the 1980s. 

What resulted was a vast catalogue of different strategies of reading and 
understanding texts. In the 1950s, Wechsler had argued that courts should 
decide constitutional questions by reference to neutral principles that could be 
defended regardless of the particular dispute at hand (1959). And Black and 
Ely tried to argue that one need not look outside the text at all. Staking out 
“structuralist” positions (albeit different ones), they argued that the meaning 
of individual clauses in the Constitution could and should be derived either 
from the internal structure of the overall document or the sense that could be 
made of the vision of government and society that such a text contained (Black 
1969, Ely 1980). Also trying to stay close to the text, Schauer noted that the 
plain language of constitutional provisions could convey meaning unprob- 
lematically to ordinary readers most of the time (1985). 

But an assault on these strategies of reading was mounted by a set of authors 
who were concerned with specifying the social context under which language 
could be read like that. Fish argued that the idea of a single literal language 
and a recourse to the text alone only fools the interpreter, since the meaning 
of words is always supplied by context. While in some circumstances the 
context may be taken for granted enough to be unproblematic, it is always 
socially situated in a particular “community of interpretation” that ultimately 
judges the "reading" to be reasonable or not (1980). Fiss also located grounded 
readings in the larger community, in the invocation by a judge of public values 
(1982). Dworkin argued that the relevant context could be constructed by a 
judge who first developed a justifiable political theory and then read a legal 
text in light of that theory (1986). And Sunstein argued that one needs to situate 
texts in the context of the political process that gave rise to them to understand 
how they might be read (1990). Levinson provides a useful and humorous 
summary of this literature in his analysis of the "adultery clause of the Ten 
Commandments" (1985). 

In the meantime, other theorists were taking a more cultural view of the 
interpretation issue, Rather than trying to justify particular answers to particular 
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legal questions, interpretive theorists developed accounts of interpretive prac- 
tice as an ongoing activity of legal and other social institutions. Starting from 
the stark proposition, “We inhabit a nomos—a normative world,” Robert Cover 
examined the way in which any community contains within it multiple com- 
munities of normative meaning. Legal interpretation links general nomos with 
particular narrative, but in so doing judges flatten the normative diversity of 
the social world through exclusion of alternative perspectives (1983). Later, 
Cover showed how the interpretive acts of judges could be seen as acts of 
violence since they produced punishment, disruption, and violence in the world 
(Cover 1986, Sarat & Kearns 1991, 1992). 

Other cultural theorists examined how legal representation and broader 
cultural images both constitute and resist each other. Geertz showed the inter- 
pretive fluidity around central legal conceptions in different cultures (1983). 
JB White explored the disjunctures that result when one system of thinking is 
translated into another (1990). Scheppele showed how the construction of facts 
is always an interpretive enterprise (1990). Humphreys shows how much of 
law could be seen as discourse (1986). The interpretive revolution ran all the 
way through legal scholarship. 


Postmodern Legal Theory 


Postmodernism can be described as a theoretical stance that attacks modernism. 
As an intellectual project, modernism creates order, systems, structures and 
plans through conceptual neatness and appeals to shared values. Postmodern- 
ism is, by modernist standards, messy. It separates images from the objects 
that such images claim to represent and in so doing denies the possibility of 
finding truth; it combines cultural shards, textual pieces, imaginaries. Post- 
modernism defies logic. It, perhaps fittingly, also defies common definition 
by those who claim to practice it (Schanck 1992). 

In some legal postmodernist writing, modernism is equated with "the En- 
lightenment project" of liberalism, and so postmodernism becomes a critique 
of liberalism (Schlag 1990). Some postmodernist legal theorists emphasize the 
social transformation that has broken apart modernist orderings, creating 
8 postmodern condition to which legal interpretation must respond (Balkin 
1992). Others emphasize the impossibility of creating legitimate legal inter- 
pretations in the face of the now-demonstrated instability of texts (Hunt 1990). 
As it is used in legal theory these days, postmodernism tends to be a critical 
discourse. As a result, postmodern scholarship is allied with some aspects of 
critical legal studies (Peller 1985), feminism (Cornell 1991, Patterson 1992, 
Frug 1992b), and critical race theory (Thomas 1992), in addition to bringing 
instrong strands of sociological constructivism (Santos 1987, Eisenstein 1988). 
Even Derrida himself has turned his attention to the deconstruction of law 
(Derrida 1992). 
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Narrative Jurisprudence 


Narrative jurisprudence understands legal discourse and the discourse of legal 
subjects as stories (Scheppele 1989). Recent social theory indicates that people 
interpret what happens to them narratively by fitting them into story structures 
(Bruner 1991, Carr 1990). Courts, too, makes sense of things as stories, since 
they hear cases as elaborations of particular events through story forms (Hastie 
& Pennington 1991). 

Narrative jurisprudence has two distinct strands. In one, linked to some 
versions of feminism and critical race theory, narrative is the method through 
which oppressed groups and individuals make their experiences visible in law. 
“Outsider jurisprudence” (Matsuda 1989b) involves women and people of 
color telling stories of oppression, sometimes as parables (Bell 1992), some- 
times as personal revelations (Williams 1991, Ashe 1989, Scales-Trent 1990), 
sometimes as a strategy of empowerment (Delgado 1989, Mahoney 1991), 
sometimes as guide to legal practice (L White 1990), sometimes to reform the 
law (Matsuda 1989b). These stories are consciousness-raising in the legal 
literature, making experiences visible that might otherwise not be known to a 
community that is predominantly white and male (Abrams 1991). 

The other strand of narrative jurisprudence analyzes stories that have ap- 
peared in legal settings, sometimes to work out what makes some stories more 
believable than others (Bennett & Feldman 1981, Conley & O’Barr 1990, 
Scheppele 1992), sometimes to use them as a way of exploring the social 
context that makes these narratives compelling (Foucault 1975, Ginzburg 1983, 
Guha 1987, Davis 1987), sometimes to identify their standard forms (Sherwin 
1988, Lopez 1984). Bringing methods from the humanities into the analysis 
of legal stories, narrative theorists try to move away from abstraction to the 
concrete experiences of particular individuals (Elkins 1985, West 1985). 

Stories add in the vividness of detail and context what they may lose in 
representativeness. Some writers find this disturbing, claiming that the focus 
on the potentially idiosyncratic individual case obscures larger social patterns 
(Farber & Sherry 1993, Tushnet 1992). 


DISCURSIVE STRUCTURALISMS 


Social theorists have been making structuralisms more open to discursive 
influence through a new emphasis on the social construction of meaning. 
Where old structuralisms envisioned society as a set of rigid structures off 
which people bounced like tennis balls, the new discursive structuralisms see 
society as a dynamic interplay between meaningful action and more enduring 
social formations. Bourdieu, Giddens, Foucault, and (in one reading, at least) 
Luhmann have been influential in legal theory by providing a way to think 
about connections between legal knowledge and social structure. 
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Practice Theory 


Developed through the works of Bourdieu and Giddens, practice theory in- 
volves tracing the connections between the daily practice of social agents and 
the larger structures that those practices constitute, resist, and change. Bourdieu 
is centrally concerned with the reproduction of social structures through the 
meaningful activity of agents. From the standpoint of agents, social structures 
appear as a set of inevitabilities, the naturalized patterns of thinking and doing 
that constitute a "habitus." But agents create these structures over time by 
reproducing meaning with the imprint of power (Bourdieu 1977, Bourdieu & 
Wacquant 1992). In his analysis of the “juridical field," Bourdieu shows how 
legal concepts and categories form a habitus within which legal practitioners 
engage the social world and within which struggles over meaning and power 
are contained (Bourdieu 1987). 

Giddens elaborates a view of the "duality of structure," envisioning social 
structures as both the precondition and result of social practices. Constraining 
and enabling human agency, social structures comprise generative rules and 
relationships that social actors use strategically to make their way in the world 
(Giddens 1979). 

Practice theory is starting to catch on among legal theorists, sometimes within 
acriticallegal studies framework and sometimes outside. Coombe (1989), Peller 
(1985), Boyle (1985), and Francis (1986) are working out applications of 
practice theory in areas as diverse as sexual violence and debt collection. 


Genealogical Inquiries 


According to Foucault, genealogies are "histories of the present" that re- 
construct the past, showing how knowledge of the “real” and the “inevitable” 
changes over time. For Foucault, power is inextricably intertwined with knowl- 
edge, so that “discourses” (relatively autonomous fields of representation and 
belief) both shape struggles for power and compete among themselves for 
dominance in the social world. Law, for Foucault, is a historically situated set 
of discourses that produce conforming subjects (Foucault 1977, 1978, Hunt 
1992). 

Foucault's work has produced a growing literature in sociolegal studies that 
is simultaneously sympathetic and critical. Simon's work on the uses of parole 
(1993), Constable's book on the genealogy of the mixed jury (1994), and 
Garland's book on theories of punishment (1990) are particularly creative 
genealogies of legal practices. 


The Amherst School 


The Amherst Seminar on Legal Ideology and Legal Process has combined 
elements of practice theory with Foucauldian insights about the discursive 
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construction of legal consciousness (Silbey 1985, Bumiller 1988, Ewick & 
Silbey 1992). Members of this interdisciplinary seminar have concentrated on 
the way in which legal ideologies, the concepts and categories of legal thought, 
are often negotiated in the process of application (Mather & Yngvesson 1980— 
1981, Sarat & Felstiner 1986, Harrington 1985). They recognize that legal 
ideology is found not just in official legal settings like courtrooms, but it is 
also mobilized and constituted in nonlegal settings as well (Merry 1990, 
Yngvesson 1989, Brigham 1987). And they explore these processes doing 
ethnographies, collecting stories, and following the traces of legal discourse 
in everyday life as well as traces of everyday life in legal discourse. 


Autopoesis 


Niklas Luhmann at first glance looks like a structural functionalist with his 
emphasis on systems theory, but on closer reading some of the same discursive 
structuralist tendencies are apparent. In A Sociological Theory of Law, Lub- 
mann describes law as the institution in society that deals with disappointed 
expectations, both normative and descriptive (Luhmann 1985). When viola- 
tions occur in what one expects would be done or in what one believes should 
be done, law is the institution that remedies disappointed expectations by 
rewriting the world. Most of those who have been influenced by Luhmann 
focus on the way in which law is constituted as a relatively autonomous system 
in a differentiated society, emphasizing systemic aspects of law (Teubner 
1988). But Luhmann may also be read to say that law is an institution that 
mediates between expectations and potential future states, between cultural 
formations and material circumstances. 


LEGAL PRAGMATISM 


Perhaps exhausted by all of this theorizing, legal scholarship has turned to 
pragmatism (Brint & Weaver 1991). Pragmatism eschews large conceptual 
schemes in favor of contextualized knowledge. It starts from the present "here 
and now," rather than from imagined neutral places and times. It goes some- 
where that will make a difference, doing only what is necessary to solve 
practical problems at hand. It is antifoundationalist, believing that knowledge 
has only the organization we bring to it and that the search for first principles 
inevitably turns up nothing very useful. To the pragmatist, truth comprises 
those things we know that hang together with everything else we believe to 
be the case. Theory is not what is done on special occasions; theory is what 
each of us does all the time to make sense of things. 

Law, to the pragmatist, is not a pure logical system, but a set of practices: 
situated, instrumental, sensitive to time and place (Grey 1989, Minow & 
Spellman 1989). Judges should decide cases without grand schemes, but with 
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sensitivity to the local context of each case (Farber 1988). Legal theorists 
should stop straining at abstractions and should examine history and culture 
to understand how law works. And many legal theorists have joined the 
pragmatist cause, creating alliances between pragmatism and feminism (Radin 
1989), the left generally (Baker 1992), and most schools of legal thought. In 
fact, one commentator noted, "it seems only a slight exaggeration to suggest 
that a movement which five years ago included almost no one today appears 
to embrace virtually everyone" (Smith 1990: 409). 


AND SOCIOLOGY? 


This necessarily brief survey of a wide range of legal theorizing should con- 
vince sociologists that many of the same intellectual currents that preoccupy 
sociologists are also central to jurisprudence. But it should also say more. 
Social theory has its roots in legal theory, and the agendas of classical social 
theorists are tied up with jurisprudence. Contemporary social theory is strug- 
gling with many of the same issues as contemporary legal theory is, but now, 
with the differentiation of disciplines, each field is more likely to go it alone. 
If legal theorists and social theorists followed the developments in each other's 
disciplines more closely, we each might avoid some blind alleys that the other 
discipline has already discovered, and we might learn something in the juxta- 
positions that would otherwise go unnoticed. 
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systems, neural networks, genetic algorithms, and classifier systems. The first 
major accomplishments of artificial social intelligence (ASI) have been in the 
realm of theory, where these techniques have inspired new theories as well as 
helping to render existing theories more rigorous. Two application areas for 
which ASI holds great promise are the sociological analysis of written texts 
and data retrieval from the forthcoming Global Information Infrastructure. ASI 
has already been applied to some kinds of statistical analysis, but how com- 
petitive it will be with more conventional techniques remains unclear. To take 
advantage of the opportunities offered by ASI, sociologists will have to become 
more computer literate and will have to reconsider the place of programming 
and computer science in the sociological curriculum. ASI may be a revolu- 
tionary approach with the potential to rescue sociology from the doldrums into 
which some observers believe it has fallen. 


INTRODUCTION 


Broadly defined, Artificial Social Intelligence (ASI) is the application of ma- 
chine intelligence techniques to social phenomena. ASI includes both theory 
building and data analysis. At the theory building level, ASI includes the 
computer simulations of societies or groups of organizations in which the 
individuals are modelled as intelligent actors. At the analysis level, ASI in- 
cludes the use of AI techniques for intelligently searching and analyzing data. 

A decade has passed since the first conference on sociology and artificial 
intelligence was held (Gilbert & Heath 1985), yet sociologists have made 
relatively less use of AI than have practitioners of other behavioral sciences 
(Anderson 1989). Certainly, there is a faddish quality to much AI work, and 
much publicity has been given to computer programs that achieve only the 
pretence of intelligence, not its substance. Another factor may be that sociology 
has placed less emphasis on cognition in recent decades than on social struc- 
ture, outside of areas like symbolic interactionism and the sociology of knowl- 
edge where use of computers is relatively undeveloped. The new interdisci- 
plinary field of Cognitive Science is rooted in AI and draws upon five more 
traditional fields: psychology, philosophy, linguistics, anthropology, and neu- 
roscience (Heckathorn 1989). For whatever reason, sociology is notably absent 
from this list. 

Traditionally, computer scientists working on AI have ignored the social 
roots of human intelligence. In the past five years, however, there has been an 
upsurge of interest among these researchers in the social side in intelligence. 
Areas such as distributed artificial intelligence (Gasser 1991), coordination 
theory, and collaboration technology (all with strong roots in engineering or 
computer science) have begun to look at social issues. To be sure, many of 
the early programs allowed computers to engage in natural-language conver- 
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sations with humans, but the programming challenge was always to simulate 
the behavior of a single human actor. Although AI workers paid some attention 
to selected schools of thought within psychology, they ignored sociology. 
Randall Collins (1992) argues, however, that artificial intelligence cannot 
really be achieved without help from sociologists, and Allen Newell (1990) 
makes a similar point in his ground-breaking book, Unified Theories of Cog- 
nition. 

We begin with a review of the chief technical approaches that have been 
developed in artificial intelligence, providing just enough description so the 
reader can see the possible relevance for sociology; we cite a very few recent 
references that can provide a deeper introduction. Then we consider the chief 
areas of theory and empirical research in which AI has relevance for the social 
sciences, citing sociological work when possible but also identifying accom- 
plishments in neighboring social sciences that may foreshadow future socio- 
logical developments. 


TECHNIQUES OF ARTIFICIAL INTELLIGENCE 


At a first approximation, research in computer intelligence has taken one of 
two diametrically opposed approaches, which may, somewhat crudely, be 
called the “top-down” and “bottom-up” strategies. Until recently, most prom- 
inent AI researchers have focused on high-level symbolic processes that reflect 
the complex thought processes of which humans are capable. In contrast, others 
have attempted to model the basic functioning of nets of biological nerves, 
like those in relatively dumb simple organisms, with the hope that eventually 
they could work their way up to the level of human consciousness. 

A major difference between the two approaches is in the use of words. 
Symbolic process models conceptualize the world using a predominantly ver- 
bal framework. Procedures (if-then rules) and words are used to describe all 
behavior and to take all actions. In contrast, "neural models" (and those of 
similar ilk) conceptualize the world in a predominantly numeric framework. 
Equations and numbers are used to describe all behavior and to take all actions. 

Although some work on these “neural nets" goes back to the 1950s, this 
approach was eclipsed by the symbolic approach for almost 30 years (Crevier 
1993). In part, this was due to work by Minsky & Papert (1969) that demon- 
strated the limitations of one type of numeric-networked system. Misinterpre- 
tation of their results, changes in scientific paradigms, and changes in funding 
sources all played a role in reducing interest in such systems. Starting around 
1986, however, the neural network approach has achieved numerous successes 
and has grown in popularity as a scientific approach. It is sometimes referred 
to as "Connectionism" because it asserts that intelligence arises not in the 
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manipulation of symbols but in the connections between nerves, or between 
computer components. 

We begin consideration of the chief AI techniques with symbolic process- 
ing—the top-down approach—then discuss its marriage with knowledge bases 
in what are often called expert systems. Neural networks—the bottom-up 
approach—comes next. We end this section with a discussion of genetic 
algorithms and classifier systems which are particular methods of great current 
interest, combining elements of the main approaches and drawing ideas from 
biology. Naturally, this very brief overview cannot do full justice to the 
complexity of this topic, and our aim is merely to provide a reasonably accurate 
picture of representative methods in each approach. 


Symbolic Processors 


Since the first conference on artificial intelligence, held at Dartmouth College 
in 1956, most AI workers have tended to define intelligence in terms of the 
manipulation of symbols and to write computer programs that could be called 
symbolic processors (Crevier 1993). While this approach may have limitations, 
it seemed the best way to study human language and problem-solving. In 
sociology, not only symbolic interactionists but also practically every re- 
searcher and theorist interested in norms, values, beliefs, institutions, and 
organizations gives great significance to shared meanings encoded and com- 
municated through symbols. Thus, symbolic processors seem the ideal AI 
technique for analyzing these phenomena, whether by means of abstract com- 
puter simulations or the empirical computerized analysis of language and 
meaningful behavior. 

At one level, all computer programs operate symbolically. That is, both 
numbers and words are symbols. Thus all input, output and the computer 
program itself are expressed in terms of symbols. For example, a symbolic 
processor cannot actually look at a set of child's blocks and physically arrange 
them to form a tower. However, the AI researcher can tell the program where 
the blocks are, define the concept tower for it, and then ask the program to 
say how to move the blocks into a tower. At another level, a key attribute of 
symbolic processors is that they do not reason numerically, by counting and 
analyzing equations, but symbolically. They might measure an emotion in 
terms of high or low, for instance, rather than measuring it as a ratio variable. 
In this sense, these AI researchers are intellectual cousins of symbolic inter- 
actionists. 

One way symbolic processors conceptualize intelligent behavior is in terms 
of problem spaces and production systems (Newell 1990). A problem space 
is a finite collection of states (situations, arrangements, etc) and operators 
(actions) that can be represented in the computer. These include the initial state 
and the desired state that is the goal. Operators can be used to move between 
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states, to create new states, or to eliminate old states. Intelligent activity is 
characterized as the search through the problem space for the goal-state, much 
as one might wend one's way through a maze. There are many different search 
procedures, some of which are exhaustive, and some of which create interme- 
diate states on the fly. Problem spaces do not require that problems be well 
defined, but there must exist a goal state. 

In a symbolic processing system, everything is done in terms of verbal 
productions (if-then rules), including description of states and search through 
the problem space. The set of productions in a system—or production sys- 
tem—is often referred to as its long-term memory. As it searches, the computer 
creates new productions which represent steps toward the goal. This search 
can occur in either a forward chaining fashion (from the initial state to the 
goal state) or a backward chaining fashion (from the goal state to the initial 
state). Symbolic processors vary in the conditions under which productions 
are applied, how extensive the match on the left-hand side of the if-then rule 
needs to be, whether these matches are allowed to be probabilistic, whether 
the rules can be applied in parallel, and so on. 

In some symbolic processors, if there is not sufficient information to reach 
the goal state, or if some problem occurs during the search procedures, chunk- 
ing can be used to overcome these “impasses.” Chunking is a procedure that 
creates a new rule of the form, “if you are ever in situation X do Y." This rule 
is a reduced encapsulation of a chain of reasoning (following a path of many 
rules across many states and often through many problem spaces) that was 
done in order to figure out what to do in situation X. Imagine going through 
a maze, making many wrong turns, and keeping a map. Conceivably the map 
of a particular maze might be reduced to a single rule, "if you ever encounter 
this maze again turn left at every juncture." This is a chunk. The chunk itself 
does not contain all of the rules that were followed, all the missteps that were 
made, in the original search. 

Search is one key element of symbolic systems. The other key component 
is knowledge representation. In symbolic systems knowledge is often repre- 
sented using some type of network of linked concepts (Carley 1986). Minsky 
(1975) refers to the smallest connected unit, two concepts and the relation 
between them as a fact. A knowledge base can be thought of as a collection 
of facts or, equivalently, of rules. Sets of facts that are interrelated and serve 
to define a particular object or situation are often referred to as schemas or 
frames. State descriptions and problem space descriptions are often done by 
setting up and searching through particular schemas or frames. Symbolic 
processing systems vary widely in what search procedures and what knowledge 
representation schemes they use. 

A crucial test for the symbolic processing approach to artificial intelligence 
is its capacity to handle human language, and one test oi a computer pro- 
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grammer's skill is the ability to write parsers. A parser is a set of rules tied to 
a dictionary, perhaps explicitly framed as a production system, that is designed 
to extract meaning from samples of language. It is relatively easy to write a 
computer program that will respond correctly to keyboard-typed commands 
like “GO UP," "GO DOWN,” "PRINT ‘YES,’” or “ADD 2 PLUS 2.” Every 
high-level computing language (BASIC, Pascal, C, FORTRAN, etc) incorpo- 
rates a parser that translates human language into machine language com- 
mands. But the verbiage handled by most parsers is highly stylized, and the 
human must learn to stay within a fairly small set of linguistic conventions if 
the computer is to respond correctly. 

In the 1960s, computer workers expressed great optimism that they would 
soon create automatic systems for translating between languages, for example, 
taking Russian input and producing grammatically correct English output with 
the same meaning. Manifest failure came in the form of rapidly proliferating 
sets of rules and the recognition that words may have fluid and multifaceted 
meanings (Kelley & Stone 1975). Despite these problems, natural language 
processing by computers has steadily improved, stimulated both by advances 
in programming and by vastly more powerful computer hardware. 

A second challenge for symbolic processors was that if they were to be 
models of human cognition then they should solve problems as humans do 
and make the kinds of mistakes humans make. Research on language, as well 
as research on expert-knowledge, led to the conclusion that human reasoning 
often employed vast quantities of task specific information. This led some AI 
researchers to focus on the role of task knowledge in symbolic processing 
systems. This research grew into the area of expert systems and knowledge 
engineering. Many expert system programs have been written in LISP or 
PROLOG, languages developed for symbolic processing, and the rule struc- 
tures are quite comparable (Clark 1982, Cameron & Dixon 1992). 


Expert Systems 


Within sociology, expert systems may potentially revolutionize qualitative 
sociology the way computerized statistical packages have revolutionized quan- 
titative sociology (Brent 1986, Benfer et al 1991, Ohly 1993). Expert systems, 
for example, can be used to facilitate the coding of texts (Carley 1988), which 
may be as diverse as the words exchanged by members of a committee, the 
writings of a sociological theorist, or the public actions of international nego- 
tiators. Expert systems have found several applications in social welfare and 
human services (Schuerman et a! 1989, Gingerich 1990, Mutschler 1990), 
assisting the professional in giving help, and the creation of their knowledge 
bases is practically equivalent to Al-assisted research on aspects of the pro- 
fession and the social problem it addresses. 

In the editor's statement at the back of every recent issue, the International 
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Journal of Expert Systems says its topic is "knowledge-based approaches to 
the construction of intelligent artifacts... A system is ‘knowledge-based’ when 
its behavior depends largely on information encoded in it or to which it has 
access, and is a[n] ‘expert-system’ when this knowledge would be considered 
expertise in a human." By these criteria, the spell-checker of a word processor 
could be an expert system. It has information that allows it to duplicate the 
expertise of a good human speller. However, one might want to reserve the 
term "expert system" for something a little smarter, that was able to respond 
in a complex way to different situations. While there is no clear line of 
demarcation, many would consider a good income tax package to be an expert 
system. 

An expert system can be thought of as a symbolic processor designed to 
solve some particular problem in a fashion similar to that of a human expert 
in that area. Current expert systems typically have three parts: an inference 
engine, a knowledge base, and a user-interface (Gonzalez & Dankel 1993). 

The inference engine is a symbolic processor that controls which rules are 
applied when and how. It contains general knowledge about how to conduct 
searches given the knowledge representation scheme being used within the 
expert system. The inference engine is often referred to as the shell. To create 
an expert system, researchers may purchase a shell and then add their task- 
specific knowledge base. 

The knowledge base is a collection of task-specific (or domain) knowledge. 
In most current expert systems, it contains some parts of this knowledge in 
the form of rules and others in the form of auxiliary relational or object-oriented 
data. 

Most current expert systems also have a separate user interface, the part of 
the package the user interacts with, which facilitates entering rules, generating 
reports, and managing a variety of ancillary tools. Producers of expert system 
shells have continually added enhancements in the user-interface area, includ- 
ing graphic ways of visualizing the structure of the knowledge, statistical 
analysis packages, and even the capacity for the system to learn the habits and 
priorities of the particular user. Greater use is being made of hypertext, the 
organization of textual material in a nonlinear manner, allowing the user to 
call up all kinds of information at any point in the process and to roam the 
knowledge base at will. 

Creation of an expert system by means of a pre-existing shell involves the 
work of both a knowledge engineer and one or more domain experts. The 
knowledge engineer is trained in the use of the shell, has experience in eliciting 
information from other people (usually through interviews), and has experience 
in converting this information to a set of rules. The domain expert is a person 
who is thoroughly familiar with the field of knowledge the expert system is 
being developed for and may know nothing about computers. For example, a 
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knowledge engineer may be a computer scientist and a domain expert may be 
a neurosurgeon. 

For example, to create an expert system intended to facilitate medical 
diagnosis in a particular category of diseases, the knowledge engineer will 
locate and interview a number of senior diagnosticians in that field, perhaps 
supplementing their interview responses with information from technical pub- 
lications and other sources. After such a system has been created, it will be 
distributed to medical personnel who lack expert knowledge of the particular 
diseases. When they encounter cases they have difficulty diagnosing, they will 
turn to the expert system, which typically will ask a number of questions about 
the case, then suggest a diagnosis. Thus, the expert system acts as an expert 
consultant. 

A chief challenge for expert systems is uncertainty. First, the domain of 
knowledge may be incomplete, probabilistic, problematic, or poorly organized. 
Second, the questions the user poses to the system may be sketchy and based 
on insufficient information about the case. These factors make it difficult to 
locate a deterministic solution. A well-designed system that lacks crucial 
information about a case will ask the user for it, but there are limits to how 
well additional information might resolve such ambiguities. Therefore, many 
expert system shells employ a variety of mathematical techniques to weight 
different information and provide estimates of its confidence in its conclusions, 
perhaps listing several possible conclusions with associated confidence scores. 
These weights can affect which rules are applied to the data in coming to a 
conclusion. Among these techniques are Bayesian probability measures, cer- 
tainty factors, and fuzzy logic (Gonzalez & Dankel 1993:232-62). 

In practice, commercial expert systems have run into a number of difficul- 
ties, chief among them the problem of finding competent human experts, the 
great cost of all the human labor required to create a worthwhile system, the 
difficulty in articulating and systematizing knowledge in many domains, and 
the high cost of updating expert systems as their domains of knowledge 
develop. In addition, there are social, organizational, and legal constraints on 
the acceptance and use of expert systems. Entrenched professionals can feel 
threatened by them. Weaver (1986) predicted that medical expert systems 
might have the effect of regulating physicians' behavior, undermining their 
authority and prestige, and leading to a new division of labor among medical 
professions. This has not yet happened. The public, who might benefit from 
the presence of expert systems, often feel distrustful of machines and would 
rather talk to human experts. Finally, there is the issue of who bears legal and 
moral responsibility when the computerized system makes a costly mistake. 
For these and other reasons, the term "expert system" is often avoided in 
commercial applications, in favor of intelligent decision support systems, in- 
telligent advisory systems, consultant systems, and knowledge-based systems. 
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Shangraw (1987) has identified a number of ways in which the approaches 
of social scientists and knowledge engineers differ. Sociologists are often 
concerned with the bases for judgments, and they know tbat people's views 
are powerfully shaped by social, cultural, and economic factors. Consequently, 
sociologists see experts as having "biased" knowledge. Social scientists seek 
to maximize the validity of judgment. In contrast, knowledge engineers are 
concerned with duplicating the judgments of the experts rather than criticizing 
or explaining them. For knowledge engineers, any bias in judgments is im- 
portant only if it results in too much inter-expert disagreement, and if this is 
the case, an expert system will not be built. If there is only a limited amount 
of disagreement, probabilistic weights on the various information will be used 
to reflect the level of inter-expert agreement. Knowledge engineers seek to 
replicate and systematize expert judgment. For the social scientist, the knowl- 
edge base itself is a source of data for further analysis. 


Neural Networks 


Artificial neural networks are computer systems involving software (and some- 
times hardware) that in some sense emulates the behavior of biological nervous 
systems (Rumelhart & McClelland 1987, Wasserman 1989, 1993, Karayiannis 
& Venetsanopoulos 1993). After a very short period of recent development, 
neural nets have accomplished many useful tasks. For example, it has been 
demonstrated that a sufficiently large and well-trained neural net can accurately 
model any continuous mathematical function whatsoever (Smith 1993). Al- 
ready, neural nets have been used to model economic data, competing suc- 
cessfully with more traditional techniques such as multiple regression, and the 
same approach can be applied directly to quantitative data collected by soci- 
ologists. As is true of symbolic processors, neural nets can model the behavior 
of intelligent actors in theory-based simulations, thus greatly expanding the 
tools available to sociological theorists. 

Neural nets are radically different from other approaches to artificial 
intelligence and quite unlike the statistical software familiar to sociologists. 
Information is stored in the connections between nerve-like structures, in a 
distributed fashion, so that a particular datum is spread across a number of 
memory registers that it shares with other data, rather than assigning each 
datum to a distinct address in the computer's memory. Conventional statis- 
tical software reads its data in from a disk in essentially the same form it 
stores that information in a memory array. Neural networks, in contrast, learn 
information in training sessions, much as a human might learn, and they 
store information in forms that bear no resemblance to the raw data that were 
presented to them. Symbolic processors can also learn during training ses- 
sions; the form of the information stored in them may or may not resemble 
the form of the raw data. 
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A. simple neural net might consist of fewer than a dozen nerve-like units, 
sometimes called neurons, nodes, or neurodes. There might be three layers of 
units. The first, or input layer, receives data. The third, or output layer, sends 
out the net's reactions to the data. Between them lies a layer of "hidden units" 
that are not directly connected to the outside world. Typically, every input unit 
is connected to every hidden unit, and every hidden unit is connected to every 
output unit. Associated with each connection is a connection strength or 
weight, a particular number that changes as the network learns to respond 
properly to a set of input data. 

The net is trained by presenting it with a series of cases, each of which 
consists of an input vector and an output vector. The input vector is a set of 
numbers applied one each to the input units. Then the net produces an output, 
by using the connection weights to transform the input. The training algorithm 
compares the net's actual output with the desired output vector, and it follows 
a complex set of procedures to propagate the error back through the network 
(back propagation), adjusting all appropriate connection weights so that in 
future the net should respond with less error. In fact, computer scientists have 
experimented with many different network structures and training algorithms, 
but the basic idea is constant. A network of nerve-like units learns to respond 
in a desired way to a training set of data and then is ready to analyze fresh 
data of the same kind. 

Neural nets are ideally suited for parallel processing, which is a significant 
revolution in computer technology. Traditional electronic computers consisted 
essentially of a bank of memory registers and a single central processing unit 
(CPU). The CPU would be in charge of everything, and all data and program- 
ming commands would pass through it. Thus, the speed of a computer was 
determined by the speed of this CPU, and everything depended upon the 
efficiency and reliability of a single electronic component. Parallel processing, 
as the name implies, employs a number of processors, perhaps many thousands 
of them, operating simultaneously. If problems of coordinating the actions of 
these processors can be solved, the result is much faster operation, perhaps by 
several orders of magnitude. Parallel techniques are also important to symbolic 
processors. In symbolic processors, being able to apply multiple rules in 
parallel also reduces operation time. 

One justification for parallel distributed processing and neural nets is that 
the human brain must use something very similar. However, severe doubts 
have been raised whether existing neural nets properly simulate biological 
nervous systems, in particular whether biological nets make use of anything 
like back propagation (Hinton 1992). Another justification is simply speed. 
Doing things in parallel allows far more complex problems to be solved in 
reasonable time. 


ARTIFICIAL SOCIAL INTELLIGENCE — 417 
Genetic Algorithms and Classifier Systems 


Genetic algorithms are an evolutionary approach to the solution of problems 
with a focus on a population of entities that become progressively better 
adapted to the problem space. Each entity is a string of symbols that can be 
read by a classifier system as a set of rules. As time goes by, strings that handle 
the problems better than others become more numerous in the population, share 
features of the solution with each other, and develop new features, in processes 
analogous to biological selection, reproduction, and mutation. While it is clear 
that this approach can perform some kinds of statistical analysis on empirical 
data, it has been imported so recently into the social sciences that technical 
and theoretical issues have been given the greatest attention. 

Of special interest for ASI is that the inherent parallelism in the genetic 
based search procedure puts the intelligence of the procedure at the level of 
the population. In these systerns, it is as though individuals independently and 
in parallel examine regions of the search space and can never "know" but a 
small piece of the overall solution, yet the system itself achieves effective 
search and so makes intelligent decisions. Real human societies possess greater 
information and capacity to process data than does any given individual mem- 
ber. The population of strings manipulated by a genetic algorithm is analogous 
to the gene pool of a human population, except that it is possibly cultural rather 
than biological in nature. Despite the nonsociological origin of its metaphors, 
thus, the genetic algorithm approach reinforces the sociological principle that 
real intelligence is in essence social. 

The concept of a genetic algorithm arises in Holland's (1975, 1992) general 
treatise on adaptation in natural and artificial systems. As formulated by 
Holland (1975:20), a general adaptive problem has three components: (i) a 
system with an adaptive plan that determines successive changes in structure 
in response to an environment, (ii) the environment of this svstem, and (iii) a 
measure of the performance of different structures in the environment. In each 
application, the structures that undergo adaptation must be identified and 
represented, the mechanisms by which structures change must be specified, 
and the performance criterion affecting a structure's chances of persisting over 
time must be identified. The language of biology is useful to describe and label 
these three components. Structures are sets of chromosomes. The mechanisms 
that change structures are analogous to biological mechanisms of crossover, 
inversion (reproduction), and mutation, and the performance evaluation rep- 
resents the fitness of the structure in the environment. 

For convenience the process is viewed as a discrete time process, with the 
system following a trajectory described by the changing probability distribu- 
tion over attainable structures. At each point in time, the existing structures 
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are evaluated against the environment in relation to performance criteria. The 
system's adaptive plan then takes these evaluations and the existing population 
of structures and produces the next generation’s population of structures. Good 
adaptive plans increase the average performance of structures in the environ- 
ment. Viewed as a search procedure, the genetic algorithm produces, over 
time, a concentration of structures in regions of the problem space that have 
relatively high fitness values. 

The second fundamental idea, a classifier system, receives exposition in 
some recent work by Holland et al (1986), and is similar to a production system. 
Recall that a production is an if-then rule. In symbolic systems a production 
is spelled out using words. For Holland and his associates, a classifier is 
represented as two strings of numbers. Conditions and actions are abstractly 
represented by strings of symbols from the three letter alphabet 0", "1", and 
“# In conditions, “#” is the called the “don’t care,” meaning that either a ''0" 
or a "1" can occur in a message satisfying the condition; for instance, "03" 
interpreted as a condition is satisfied by either of two messages “00” and "01". 
In actions “#” is interpreted as a place holder passing along to the output 
message the corresponding “O" or ”1" in the input message, for instance, if 
“11" satisfies the (first) condition of a classifier having "Of" as its action part, 
then the classifier will post the output message “01”. 

A classifier K then has the form C1,C2,...,Cr/A where Cl,...,Cr designate 
its r conditions and A designates its action. A classifier system consists of n 
classifiers denoted K1,K2,...,Kn, a message list, an input interface and an 
output interface. Execution proceeds by first placing all messages from the 
input interface on the message list. The message list is then processed by the 
classifiers for matches to their conditions. Classifiers whose conditions are 
matched then post their messages to a new message list which replaces the old 
list. The list is then processed through the output interface to produce the 
system’s overall activity. Control then returns to the first step and execution 
continues. 

Associated to each classifier in the system is a quantity called its strength. 
This quantity has three functions. First, among the classifiers whose conditions 
sets are matched and therefore offer competing alternative responses to the 
current situation, relative strength determines which ones win the competition 
to post their messages. Second, the strength of a classifier serves as a measure 
of its usefulness to the system because strength is adjusted based on system 
performance. The specific mechanism that adjusts strength is called the bucket 
brigade algorithm. Third, the application of the genetic algorithm to the gen- 
eration of new classifiers uses classifier strength to choose “parent” rules for 
the next generation. 

Because of the formal simplicity of their construction, classifier systems are 
natural candidates for evolution by means of a genetic algorithm. Strings 
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specifying conditions and/or actions can be split and recombined in the cross- 
over process to produce new and perhaps better classifiers. There are several 
possible specifications of the conditions under which the genetic algorithm is 
invoked. It could be invoked at the end of some well-defined performance 
cycle as in Goldberg's (1983) system to induce expert knowledge with respect 
to the regulation of gas-pipeline transmission. Holland et al (1986) suggest as 
general triggering conditions the failure of predictions and the occurrence of 
unusual events. 


APPLICATIONS OF ASI TO THEORY 


The growing number of theoretical essays grounded in computer simulations 
is one indication that social theorists are seeking ways to render their work 
more rigorous (Hanneman 1988). We suggest that the development of com- 
puter-based artificial social intelligence can have as great a positive impact on 
theory as did computerized statistical analysis on quantitative empirical re- 
search. Properly designed ASI programs can assess the logical consistency and 
completeness of theories, help discover new implications of old ideas, and 
connect scattered hypotheses into coherent theoretical systems. ASI may in- 
spire altogether new theories, increase our appreciation of classical theories, 
and help improve and evaluate still-developing theories of social interaction 
and social structure. 


ASI-Inspired Social Theories 


Fararo & Skvoretz (1984, Skvoretz & Fararo 1989) have argued that the AI : 
concept of production system can form the basis of a general theory of social 
institutions and action structures. Since a production is a rule in which a set 
of conditions demands a particular action, social norms are productions. Insti- 
tutions and roles (distinctive sets of norms linked into cultural structures) are 
therefore production systems. Fararo & Skvoretz show how a social interaction 
is organized by the system of productions that defines the interrelationships 
of the roles being played, what they call the rolegram. Their work draws ideas 
from traditional writers, such as Talcott Parsons, and incorporates many con- 
ventional sociological concepts, but it places them in dynamic systems that 
owe much to ASI, even if they need not be realized on a computer. 

In similar fashion, Carley (1989, 1991b) has developed the constructural 
theory of social change predicated on individual adaptation at the cognitive 
level. Each individual has a knowledge base, and culture is the distribution of 
knowledge across individuals. Carley shows how social change, and processes 
such as diffusion, are controlled by the distribution of knowledge and by 
actions taken by individuals on the basis of this knowledge. Her work draws 
on traditional writers such as Homans and Durkheim and on the area of social 
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network analysis, but it places this work in a dynamic evolving system. Kaufer 
& Carley (1993) use the constructural theory to examine the social repercus- 
sions of print technology, arguing that communication technologies generate 
artifacts (such as books) that can be effectively modeled as artificial social 
agents with either more or less cognitive capabilities relative to humans. Using 
this approach they show the potential of print to change the nature of both the 
professions and academe. 

Some have drawn lessons for theory from the apparent successes of com- 
puter intelligence, notably Slezak (1989) who argues that the "strong pro- 
gramme" in the sociology of science must be wrong because symbolic pro- 
cessor AI programs have successfully derived scientific and mathematical laws 
apparently without being influenced by sociocultural factors. Writing in the 
American Journal of Sociology, Wolfe (1991) draws the opposite lesson from 
his readings about AI, deciding that humans have a distinctive form of mind 
that cannot be duplicated by either symbolic processors or neural networks, 
a conclusion that would support interpretive rather than formal systematic 
Schools of sociological analysis. Kontopoulos (1993) suggests that neural 
networks offer an appropriate metaphor for understanding social structure, thus 
incorporating insights from ASI into a general theory that need not be ex- 
pressed in computational terms. 


Expert System Models of Human Theorists 


Rule-based expert systems are a very promising tool for theory formalization, 
and they may be used to analyze the thought of particular theorists. Some 
traditional sociologists, notably George Homans and Peter Blau, intended their 
theories to be what today we might call production systems, with each axiom 
or theorem represented by a production with linkages to others. But one way 
to study the thought of any social theorist would be to attempt to state his or 
her arguments in terms of productions. This difficult task might be facilitated 
by use of a flexible and full-featured expert system shell, taking the role of 
the knowledge engineer interrogating the writings of the theorist as if they 
were domain experts. 

The Erving programs (Brent et al 1989) are an expert system that simulates 
Erving Goffman's dramaturgical perspective. Designed as a teaching tool, 
Erving takes the software's user into a "front-stage" bar, with associated 
“back-stage” party room and pool room, watching men and women interact 
and predicting their "impression management" behavior according to Goff- 
man's principles. The user assembles various questions, piece by piece, such 
as: "How would Diane feel if Dave were to lie about age in the bar.” “Would 
it be disruptive for Dave to make eye contact with members of opposite sex 
in the pool room?" The computer can test the user's understanding of Goff- 
man's theory and offer explanations to the answers it gives for any question. 
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Banerjee (1986) wrote production systems in the PROLOG language to 
simulate sociopolitical theories of Skocpol and O'Donnell. The actors in each 
system are self-aware social groups with well-developed theories of the inter- 
ests and possible coalitions in the worlds they inhabit. Skocpol's analysis of 
China in the decade following 1927, for example, posits the following seven 
actors: settled peasants, displaced peasants, Communist Party, Kuomingtang 
Party, gentry, coopted warlords, and independent warlords. In both cases, 
Banerjee found the predicted result, indicating that the two theories are logi- 
cally constructed, and if any key assumption was removed, very different 
results emerged. 


Simulations of Markets and the Iterated Prisoner's Dilemma 


Computer simulation has a long history in the social sciences (Federico & 
Figliozzi 1981, Garson 1987), and mathematical models of human learning 
suitable for use in ASI programs were available decades ago (Bush & Mosteller 
1955). However, most sociological computer simulations lacked explicit rep- 
resentations of human intelligence until recently. Today, many studies reported 
in central sociological journals employ computer models of human learning, 
decision-making, and social exchange, but they seldom mention any connec- 
tion to artificial intelligence, even though retrospectively we can identify them 
as ASI. At their borders, math models and artificial intelligence blend into 
each other, and neural networks are an especially convenient way of embod- 
ying math models in computer programs (Wasserman 1993). 

The complexity of social interaction has prompted the increasing use of 
computer simulation in place of formal mathematical models. The now-classic 
“prisoner’s dilemma" computer tournament organized by Robert Axelrod may 
have been the turning point (Axelrod 1984). The prisoner's dilemma is a 
game-theoretic problem that explores the conditions under which cooperation 
may arise between self-interested economic actors who might gain in the short 
run if they violated agreements to cooperate (Rapoport & Chammah 1965). 
Axelrod invited people to submit computer programs that followed various 
strategies for playing repeated rounds of this game (the iterated prisoner's 
dilemma or IPD), and his tournament showed that simulation can produce 
robust yet unexpected results. The winner was one of the simplest contestants, 
a strategy of “tit-for-tat” that merely cooperated in the first move and thereafter 
imitated the previous action of its interactant. The simulation results showed 
how a simple norm of reciprocity could gain a toehold even in a harshly asocial 
environment and then go on to flourish. This strategy was able to displace 
much more cognitively sophisticated contestants. In short, the intelligence 
needed to find a way out of the social trap is not always isomorphic with the 
cognitive and analytic faculties of the organisms. A key contribution of ASI 
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is the recognition that problem solving can sometimes depend more on what 
goes on between organisms than on what goes on within them. 

Subsequently, other researchers have held quite complex tournaments, such 
as one staged by Rust et al (1993) that was a double-auction market, like the 
one actually conducted in commodities and options by the Chicago Board of 
Trade. Its winning entries tended to be simple production systems with rela- 
tively little intelligence, but one competitor was a mammoth neural net with 
fully 1262 connection strength and bias parameters, that learned by means of 
genetic algorithms. For several years, the IPD strategy that drew the most 
attention from researchers and theorists was tit-for-tat, but a rival called 
“Pavlov” has recently seized center stage (Nowak & Sigmund 1993). This 
strategy has the individual actor continue to behave in a given way (keeping 
bargains or violating them) so long as it wins, and to shift behavior as soon 
as it loses. Because both tit-for-tat and Pavlov have the simulated person pay 
attention to what happened in the previous exchange, they connect directly to 
sociological theories of social learning. 

In a series of theoretical papers based on ASI simulations, Macy (1990, 
1991a,b) has developed stochastic learning models for the IPD that show how 
it is possible to “walk” out of social traps. The prisoner's dilemma is a trap, 
because the contingencies encourage people to act in ways that are not in 
accord with their own long-term interests. Real human life may be filled with 
social traps, in which decisions that make sense to each individual aggregate 
into outcomes that make no sense for all. In a typical run of this series, each 
individual's probability of cooperating depends upon past experience. The 
interacting population may be able to escape from a noncooperative equilib- 
rium if a sequence of random events (the proverbial “drunkard’s walk") brings 
them near enough to a cooperative equilibrium for them to settle on it. This 
research is a critique of rational choice theory, showing how learning theory 
can solve some of its problems. And like Axelrod's work it demonstrates that 
social actors may be able to escape the Hobbesian state of nature without the 
help of a king, a shared set of altruistic values, or even the degree of intelligence 
required to understand their situation fully. 

An amazing variety of excellent work has been based in computer simula- 
tions of similar exchange systems, or on experiments with humans that might 
be further elucidated via simulations. Kollock (1993) has examined the effect 
of random errors and mistaken perceptions on the relative effectiveness of 
strategies like tit-for-tat. Orbell & Dawes (1991) explored the evolution of a 
cooperator's advantage when actors were allowed to withdraw from interac- 
tion. Frank (1988, 1993) also allowed exit from the game in his theoretical 
explorations of rational processes that led to the evolution of displays of 
emotion, thereby unleashing much simulation work by other researchers. Vila 
& Cohen (1993) modelled exchanges among individuals who could adopt 
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either of two strategies, producing wealth or expropriating it, thus exploring 
à theory of theft based upon earlier work in behavioral population biology. 
This last study suggests that ASI may have a considerable impact on crimi- 
nology and the sociology of deviance. 


Simulations of Networks, Groups, and Organizations 


Many studies have examined social structures by means of ASI simulations, 
usually without explicitly acknowledging that artificial intelligence was in- 
volved. A good example for those who want to learn about ASI in connection 
with social networks is an article by Markovsky (1987, cf Markovsky 1995), 
because it focuses on the elemental social structure, the triad, and because it 
includes the actual source code of the program that was used. This study 
explored the power associated with position in a three-person social network 
where person A could interact with persons B and C, who however could not 
interact with each other. In a round of a typical experiment in the series, each 
person makes an offer of how 24 points could be divided between himself and 
another person. Person A compares the offers of B and C, selects one of them, 
and then the points are divided according to the average of the two persons' 
offers. After the first round (when the offers are randomly determined), each 
person adjusts his offer on the basis of what happened last time: if the previous 
offer was accepted, the new offer will be more demanding; if the previous 
offer was rejected, the new offer will be less demanding. On this simple basis, 
Markovsky was able to build a series of nineteen experiments that varied the 
strategies employed by individuals bargaining with each other. While rudi- 
mentary, the decision-making by actors, and their memories of the result of 
previous exchanges, constitute ASI. Similar work exploring the implications 
of structure in slightly larger networks has been undertaken by several re- 
searchers (Yamagishi et al 1988, Markovsky et al 1993). 

Other examples cover a wide sociological territory. Feinberg & Johnson 
(1988, 1990) simulated the effect of an outside agitator on crowds, moving 
individuals physically toward the center of a mob and moving them mentally 
toward the agitator's preferred action. The individuals differed initially in terms 
of suggestibility and the propensity to move as well as in physical location 
and action choice. McPhail et al (1992, McPhail & Tucker 1990) modelled 
the physical movements of individuals as they threaded their way through 
crowds to reach a destination while remaining with each other in collective 
locomotion. Hummon (1990) simulated bureaucrats accepting, rejecting, and 
referring tasks on the basis of their growing experience with different kinds 
of work, thus creating the division of labor in a network. Anderson (1991) 
modelled social influence on voting behavior in small groups of union mem- 
bers. Bainbridge (1987, 1995) employed neural networks to simulate actors 
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with the intelligence to develop schemes for categorizing exchange partners 
and capable of learning which categories are most rewarding. 

There is a long tradition in the use of simulation in organizational theory 
(Cyert & March 1963, Cohen et al 1972, Harrison & Carrol 1991). Recently, 
however, these simulations have included the use of ASI concepts (Masuch & 
LaPotin 1989, Carley et al 1992). As a field, organization theory has begun to 
recognize that theories of organizations need to take into account the complex 
interdependencies among people, machines, tasks, and society, the adaptability 
of both people and organizations, and the processes through which organiza- 
tional behavior emerges (Masuch & Warglien 1992, Carley & Prietula 1994). 

Traditional techniques for theory development are not sufficient for dealing 
with the difficulties inherent in theorizing about organizations as complex 
adaptive phenomena. Expert systems that can embody the theories and help 
the research to apply the theories systematically are making it possible to test 
cogently the full extent of complex theories (Baligh, Burton & Obel 1990, 
1992). Computational models, particularly those in which the agents in the 
organization are modeled as complex, adaptive, and intelligent, are being used 
to examine emergent organizational behavior and to develop theory. Illustra- 
tive models include Masuch & LaPotin (1989), Cohen (1986), Carley (1991a, 
1992), Harrison & Carrol (1991), Lin & Carley, (forthcoming). An area of 
related interest is distributed artificial intelligence Bond & Gasser (1988). 

Recently, organizational models have been built out of collections of Soar 
agents. Soar is a production system language with built-in procedures for 
chunking (Laird et al 1987). It has been viewed as a unified theory of cognition 
(Newell 1990, Carley & Wendt 1991), because it embodies many of the 
findings on human cognition that have appeared in cognitive science. On 
numerous tasks it exhibits behavior similar to that of individual humans, and 
the recent organizational work asks whether in fact Soar can also act like 
humans in a social context. The first such study examined an organization of 
agents engaged in a warehouse task (Carley et al 1992). In this case, each Soar 
agent was run on a different machine, and the organization was formed out of 
a network of computers. If this vivid metaphor is not considered important, 
such simulations can be run on a supercomputer, simply allocating different 
sectors of memory to separate individuals. 

Some traditional sociologists might complain that computer simulations 
inappropriately reduce social interaction to predictable, mechanistic cartoons 
that fail to capture the complexity and indeterminacy of human affairs. There 
are two refutations to this uninformed stereotype. First, even simple models 
often have great predictive power, and simulation results can and do often 
match empirical data both qualitatively (Carley 1991, Kaufer & Carley 1993) 
and quantitatively (Carley 1990, Carley & Lin, forthcoming). 

Second, computational modeling can often generate surprising results. 
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Markovsky (1992) found that even very simple models of interaction across 
networks took the researcher beyond the limits of predictability. Recently, 
there has been great interest in the role of chance in several of the sciences, 
and the concept of deterministic chaos has been the subject of many publica- 
tions of both scholarly and popular kinds (Mandelbrot 1983, Hao 1984, Gleick 
1987). Kephart et al (1992) have noted that social behavior can become chaotic, 
and they ran simulations that showed how some intelligent strategies can 
reduce chaos, in particular giving actors the capacity to base their actions both 
on beliefs about the strategies of others and on the observed behavior of the 
system of agents. 

Simulation work employing genetic algorithms has only just begun. Free- 
man (1993) used one to solve an old problem in social network analysis, 
namely, the partitioning of members of a group into cliques or subgroups based 
on members' proximities to one another. The algorithm processes assignments 
of individuals to subgroups searching for an assignment that maximizes a 
fitness function that is sensitive to misclassification of individuals and to 
average proximities. Axelrod (1987) has switched to genetic algorithms to 
evolve strategies in the Iterated Prisoner's Dilemma. Skvoretz & Fararo (1994) 
conceptualize social exchange in a game-theoretic context and apply a genetic 
algorithm to the evolution of mutual aid strategies. The game is similar to the 
prisoner's dilemma and the analysis is similar to Axelrod's. However, they 
introduce rudimentary role-differentiation into the problem, as strategies can 
function either as assistance requesters or providers of help, and they contrast 
the genetic-algorithm findings with those from a learning model implementa- 
tion of the problem. 


Data-Based Simulations 


Conceptually intermediate between theory-driven simulations and quantitative 
analysis of data are simulations that might be described as data-based. The 
notable example, with a 20-year history, is the work on affect control theory 
begun by Heise (1986, 1987, Smith-Lovin 1987). Based on the EPA model of 
affective meaning developed by Charles Osgood, who employed semantic 
differential techniques to identify three dimensions of word meaning (Evalu- 
ation, Potency, Activity), affect control theory asserts that social events are 
constructed so as to confirm the meanings of social classifications. Research 
subjects have provided mean EPA ratings of thousands of words describing 
social identities, attributes, behaviors, and settings in five nations. Heise and 
his co-workers have derived a number of mathematical functions to predict 
how people would rate various combinations of words, and embodied both the 
formulas and the data in computer programs. Their theoretical agenda partic- 
ularly stresses differences between social events that confirm or disconfirm 
sentiments attached to key nouns such as those describing standard social roles, 
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but their general method could be applied quite widely. Simulations based on 
data do not qualify as ASI unless they also incorporate a dynamic model of 
human thought, but because Heise's programs meet this test they give powerful 
testimony to the great potential of ASI to bring theory and data together in 
important new ways. 


ASI APPLICATIONS TO RESEARCH 


Al-assisted empirical sociological research is still in its infancy, so it is difficult 
not only to predict the range of applications it will have in future, but also to 
identify the current work that deserves closest attention. However, considerable 
progress has been achieved in two areas that clearly have great potential and 
nicely bracket the diversity of techniques we have described: qualitative anal- 
ysis of verbal or written texts using expert systems or other varieties of 
symbolic processor, and enhancements to conventional statistical analysis such 
as substituting neural networks for multiple regression. 


Qualitative Research with Symbolic Processors 


In the 1960s, researchers found that very convincing interviewing programs 
could be created with surprisingly limited computing machinery and software. 
Especially controversial were programs that simulated a psychotherapist con- 
versing with a patient (Colby et al 1966, Weizenbaum 1976). More recent 
work has suggested that fully computerized interviewing may have distinct 
advantages for some kinds of research, for example, sensitive topics like sexual 
behavior where respondents might be embarrassed to answer questions posed 
by another human being (Binik et al 1989). 

Interview programs constructed along the lines of expert systems give so- 
ciologists an entirely fresh way of looking at data. Decades of quantitative 
research have been based on the concept of rectangular data matrices consisting 
of a large number of cases times a large number of variables, with the assump- 
tion that each case has a value (perhaps known or perhaps missing) for each 
variable. Relational data bases, such as incorporated in many expert systems, 
are very different in structure, as we have noted. Their topology may be very 
complex but generally consists of a network of nodes and relationships, with 
no matrix of cases by variables existing in the computer's memory. Carley has 
shown that such systems can be used to discover an individual's structure of 
meanings and then to compare that structure with the cognitive maps of other 
individuals (1986). 

Possibly the greatest research potential for ASI in the coming decade is in 
computer-assisted analysis of written text. The federal government is increas- 
ing its already significant support for development of the National Information 
Infrastructure (Information Infrastructure Task Force 1993). Whatever the 
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exact form it takes, the “NII” will involve a tremendous expansion of computer 
communication networks and on-line databases, with rapid growth of the 
libraries of text available in electronic form. This includes everything from 
newspapers, to congressional debates, to (eventually) the entire contents of the 
Library of Congress. The question then becomes: What software tools will 
sociologists need if they are to navigate effectively through this ocean of words 
and analyze selected portions of text in the most effective manner? 

Already a number of text-analysis software packages exist for microcom- 
puters. Heise (1992) has shown that much can be accomplished with an 
ordinary word processor, and software packages like HyperResearch and Ethno 
have some of the qualities of expert systems, and thus they begin to enter the 
territory of ASI. "Intelligent" search procedures and modern knowledge rep- 
resentation schemes can help pre-process data (Franzosi 1990a,b) and recode 
data (Carley 1988) for general content or map analysis procedures. For some 
of these procedures the "intelligence" is built into the coding mechanism, in 
the form of "frames" that the researcher must fill in (Roberts 1989, Carley & 
Palmquist 1992, Carley 1993). These frames, which embody vast quantities 
of expertise are then used to postprocess and analyze the data. 

Within narrative analysis, AI procedures can be used for examining, pro- 
cessing, and generating the story line in the narratives (Abell 1984, 1989). 
Ethno, for example, allows one to model event structures, including narratives, 
as production systems (Heise 1989, Griffin 1993). Related approaches are 
decision-based procedural analysis or protocol analysis, where the goal is to 
locate the explicit and implicit "rules" that the speaker uses to perform a task 
such as playing chess (Ericsson & Simon 1984). Gilbert & Heath (1986) have 
shown how PROLOG can be the basis of an intelligent system to capture public 
rules from narratives and retrieve the sense of textual items, illustrating the 
ways this would be done with medical records. Cope is a software system of 
nodes and linkages designed to produce cognitive maps of texts, thus helping 
develop grounded theory and capture verbal accounts (Cropper et al 1990). 
Automatic procedures such as Cirrus (VanLehn & Garlick 1987, Kowalski & 
VanLehn 1988) and ACM (Langley & Ohlsson 1984) have emerged, providing 
hope that larger numbers of texts can be analyzed quickly and economically. 

Social scientists of politics have used expert system shells to analyze se- 
quences of deeds and words in international relations (Schrodt 1988). For 
example, Mills (1990) created a rule-based expert system for analyzing nego- 
tiations and applied it to three sessions of talks between China and the Soviet 
Union. As the program runs, it asks the social scientist a set of questions about 
the behavior of each side at different points in the episode, then it outputs a 
summary analysis. 

Despite the disillusionments of the 1960s, natural language processing has 
made substantial progress in translating texts and extracting meaning from 
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them, particularly, in the realm of story understanding (Abelson 1976, Rumelh- 
art 1978, Schank & Riesbeck 1981, Lehnert & Ringle 1982). Plot-based pro- 
cedural analysis lends itself to automation due to the presence of basic syntactic 
units (Lehnert 1981, Lehnert & Vine 1987) that make possible the automatic 
coding of texts. 

Often the challenge is to trace linkages between texts, and a prime example 
is citation analysis in the sociology of science. Recent work in this area has 
employed search procedures from computer science to locate citation paths 
giving a history of which researcher cites which (Hummon & Doreian 1989, 
1990, Hummon et al 1990, Hummon & Carley 1993, Carley et al 1993). These 
procedures make it possible not only to identify the main path of scientific 
development, but to understand the roles played by different types of research. 


Al-Enhanced Statistical Analysis 


Neural networks can readily substitute for multiple regression and for other 
multivariate techniques that aim to predict the value of one variable on the 
basis of the values of other variables. It is claimed that neural nets outperform 
other methods, chiefly because a sufficiently large neural net can in principle 
handle any pattern of nonlinearity in the relationships and complex interactions 
between independent variables. Indeed, for some problems neural nets may 
represent overkill, and if a net is given too many hidden units, it can overfit 
the data disastrously, producing an unreasonably complex and contorted curve 
on the scattergram. 

Another disadvantage is that neural nets solve problems in ways that are far 
from transparent to human users, and they do not automatically generate lucid 
equations that can be comprehended in terms of explanatory theories. Perhaps 
this is why neural nets have been employed analytically in the social sciences 
primarily to make economic predictions (Lin & Lin 1993) when predictive 
power may be more important than explanatory intelligibility. 

Neural nets do readily produce some conventional measures such as mean 
squared error (Smith 1993), and robust estimates of errors can easily be derived 
through procedures related to bootstrapping (Dietz et al 1987, 1991). The fact 
that neural nets have been doing useful statistical analysis for less than five 
years suggests that a full kit of related tools will require further time and effort. 
Perhaps one of the best ways to accomplish that is simply to attempt a variety 
of empirical studies and see what capabilities need to be added to those already 
possessed by neural nets. 

Kimber (1991) compared neural networks with traditional methods and with 
ID3, a classification algorithm sometimes built into expert systems, to see 
which technique could best predict the emergence of democracy in nations, 
on the basis of such variables as urbanization, literacy, and economic resource 
distribution. Notably, the neural network performed better than did traditional 
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regression analysis. Similarly, neural nets did well in predicting outcomes of 
conflict between nations, in a study by Schrodt (1991). Huntley (1991) applied 
neural networks to analysis of time series data in order to forecast manpower 
needs in the Navy. Garson (1991) tested neural nets, ID3, and some more 
traditional techniques on sets of simulated data where the actual relationships 
between variables could be specified, finding that neural nets did a superior 
job with several kinds of problem. 

Clearly, neural nets are not the only AI technique that may be useful for 
statistical analysis, and some of the procedures employed in symbolic proces- 
sors could be applied to quantitative rather than qualitative data. While genetic 
algorithms have scored successes in econometric modelling (Koza 1992), their 
potential for analyzing social-scientific data remains largely unexplored. 

A range of AI techniques can also be used in automatic security systems 
that prevent misuse of confidential data while enabling maximum legitimate 
use by social scientists (Keller-McNulty & Unger 1993). Government agencies 
and private corporations frequently corrupt datasets before releasing them, to 
satisfy anonymity and confidentiality regulations. Their chief concern is to 
prevent users from identifying the record of a particular person, and they do 
this by deleting cells from a table, truncating or collapsing values, adding 
random numbers to cells or values, and removing variables. The alternative is 
to embed the dataset in full-featured statistical software that employs encryp- 
tion and analysis-monitoring techniques to prevent the user from inspecting 
the raw data directly and from deducing the identities of particular cases. An 
AI system could watch the user and prevent any sequence of statistical ma- 
nipulations that could identify a single case, and they could also link separate 
datasets about the same people through their names without letting the user 
see them. 

Expert systems and related tools such as hypertext can serve as method- 
ological consultants, whether attached to familiar statistical packages or pro- 
duced as stand-alone software. Statistical Navigator (Brent et al 1991) is a 
decision support system that helps a researcher select which techniques of 
statistical analysis are appropriate for an intended research project, and it can 
be used to give students an overview of statistical methods commonly used 
by social scientists. In its "consult mode," the software asks the researcher a 
series of questions about the aims and assumptions of the research project, and 
about the intended audience for its publications, using the answers to early 
questions to decide which other questions to ask. Many require the user to rate 
various possible objectives of the work on a 0 to 10 scale. After this, the 
software recommends a short list of methods to the user, rating them in terms 
of how well they serve the goals, assumptions, and audience for the particular 
piece of research. The hypertext feature allows the user to see a description 
of each method, linked to definitions of all technical terms, and the software 
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can also be run in "browse mode" where the user can roam this network of 
information at will. A detailed report of how the system arrived at its conclu- 
sions can be printed directly or saved onto disk for later editing. 


CONCLUSION 


Work on artificial intelligence has been subject to innumerable fads and fre- 
quently overblown publicity. Early proponents often made highly exaggerated 
claims for the computer programs they had written and hollow promises about 
what they would shortly accomplish. But these facts should not deter sociol- 
ogists from examining the potential of artificial social intelligence, because 
steady progress in computer science has brought technology to the point where 
several valuable applications already exist and even modest extrapolations 
suggest that ASI could be of great significance to sociology. 

Sociologists interested in exploring ASI will ask themselves how much they 
need to learn about computing, as opposed to relying upon computer scientist 
collaborators for expertise in that field. We think it is essential to be able to 
program in at least one high-level computing language, preferably in two very 
different ones such as Pascal and PROLOG, and to inform oneself about a 
range of recent technical developments. Particularly in the field of artificial 
intelligence, it is difficult to understand the meaning of the techniques unless 
one is capable of programming at least some of them from scratch. It is true 
that ready-made AI software exists that can be used for ASI, notably expert 
system shells and neural network packages for statistical analysis, but for the 
foreseeable future most ASI applications in sociology require writing a con- 
siderable amount of fresh code. And the fact is that sociologists and computer 
scientists have great difficulty communicating with each other, neither one 
generally appreciating the other's assumptions or understanding the simplest 
things the other says. We are convinced that collaborations between sociolo- 
gists and computer scientists will be highly fruitful, but the sociologist will 
have to go the extra mile and enter the world of computers if such projects 
are to have any chance of success. 

Current graduate training does not prepare students to take advantage of 
ASI. Although some probability theory can be useful, hardly any of the material 
taught in statistics courses is relevant to the computer techniques described 
here. Two or three decades ago, there was much debate about requiring students 
to learn computer programming, even the interesting idea of counting high- 
level computing languages toward the then-existing foreign language require- 
ments. Of course, today few sociologists write programs from scratch, because 
they cannot compete with the elaborate and accurate statistical packages avail- 
able on the market at low cost. But now, we suggest, ASI presents an array 
of new reasons to become competent in programming, not the least of which 
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is our belief that ASI skills enhance a person's capacity to think both logically 
and creatively. 

While recognizing the danger of being swept away by excessive enthusiasm, 
in a field that has been rife with fads, we believe that ASI opens an entirely 
fresh era for social theory. Sociologists are concerned with the behavior of 
societies of complex, adaptive individuals. Therefore, rigorous theory building 
and testing requires the use of mathematics or computer simulations. Even as 
physics and economics are developing computational components, so should 
sociology. 

The general public, to the extent that it has any opinion about social theory 
at all, probably considers it to be mere ideology. So long as theories are 
rambling verbal meditations punctuated with dubious metaphors, there is little 
defense against this accusation. ASI and mathematical formalism are compat- 
ible methods for stating a theory precisely, connecting its concepts in rigorous 
intellectual structures, and identifying both hidden assumptions and unex- 
pected consequences. Skillfully written simulation programs can be an excel- 
lent medium for communication of precise theoretical statements, so long as 
the intended audience has learned how to read programs. 

Whatever the future of various government initiatives in high performance 
computing, a Global Information Infrastructure is gradually emerging. Internet 
links the National Science Foundation with literally millions of computer users 
already, and many archives and libraries are steadily increasing the amount of 
text and other data available over it. Alternately, the medium for distribution 
of data may be laser discs (CD ROM) or a successor technology that carries 
information in physical objects rather than in electronic bit streams. Social 
scientists need to be involved in the development of these technologies, in part 
because we can be sophisticated advocates for the general public whose real 
data and software needs might otherwise have little influence on systems 
created by engineers and bureaucrats. As the Global Information Infrastructure 
develops, both social scientists and members of the general public will need 
ASI agents, specialized computer programs that can be sent into this practically 
infinite universe of data in search of desired information. 

Sociologists have become adept with a wide variety of statistical tools, and 
one would have thought that our quantitative methodology is thoroughly ma- 
ture at this point. Thus it is surprising to see neural networks suddenly com- 
peting with multiple regression and other well-established methods. Partic- 
ularly for analysis of texts, ASI techniques may prove superior to other ap- 
proaches, and it is possible that artificial intelligence will play a prominent 
role in management and analysis of quantitative datasets, as well. 

There is much talk these days about the malaise into which sociology 
supposedly has fallen. Whether or not this is true, ASI has the potential to 
revolutionize sociology. To be sure, if unprepared sociologists rushed to ASI 
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in search of salvation, they would undoubtedly be disappointed, and the sub- 
stantial real gains that ASI promises might be lost in their disillusionment. 
Prudent but creative incorporation of ASI methods into sociology could rein- 
vigorate stagnant subdisciplines, open new fields for exploration, keep sociol- 
ogists competitive in the study of traditional sociological questions, and pre- 
vent our discipline from falling behind other social and behavioral sciences 
that have more enthusiastically exploited the tremendous possibilities offered 
by computer intelligence. 
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Abstract 


This review explores historical changes in generational relations in American 
Society as they affect adaptation to the later years of life. Following a life 
course perspective, the review examines changes in the timing of life transi- 
tions, in family relations, and in generational and kin assistance and their 
impact on support in old age. In doing so, it demonstrates the significance of 
a historical and life course approach to the understanding of generational 
relations over time. Dispelling prevailing myths about coresidence and gener- 
ational assistance in the past, the review discusses the circumstances under 
which nuclear household arrangements were modified and explores patterns 
of assistance inside and outside the household. It links demographic changes 
in the timing of life course transitions with patterns of supports to aging parents 
in the context of changing reciprocities among kin. By comparing two cohorts 
of adult children in an American community in terms of their supports to aging 
parents, as well as their attitudes toward generational assistance, the review 
identifies historical changes in the relations between generations in the larger 
context of family relations and kin assistance. 


INTRODUCTION 


The study of generational relations in the later years of life in American society 
has been oriented to the present and clouded by myths about the past. Àn 
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understanding of this subject in contemporary society depends on a knowledge 
of the larger processes of social change that have affected the timing of life 
course transitions, family patterns, and reciprocal relations among kin (Bengt- 
son et al 1985, Shanas 1986, Bengston & Treas 1985, Shanas & Sussman 
1981). As this chapter suggests, both a historical perspective and a life course 
paradigm are important for the exploration of these processes. 

A historical perspective, in addition to providing a context of change over 
time, illuminates the ways in which historical events and circumstances have 
affected the life experience of different age groups (Hareven 1977a, Elder 
1978). At the same time, the life course paradigm helps focus attention on the 
interaction of demographic, social-structural, and cultural factors in shaping 
family patterns and generational relations. It illuminates the impact of historical 
events on the lives of various cohorts, and their consequences for old age 
(Elder 1974, Elder 1993, Riley et a1 1978, Hareven 1991a). 

A life course perspective thus provides a necessary dimension and an inte- 
grating framework for the study of intergenerational relations because it is 
both developmental and historical. It provides an understanding of the ways 
in which earlier life experiences of older adults, as shaped by historical events 
and by their respective cultural heritage, have affected their values governing 
family relations, their expectations of kin support, and the nature of their 
interaction with welfare agencies and institutions. 

Generational relations in old age can be best understood in the context of 
the entire life course and of the historical changes affecting people at various 
points over their lives. They are molded by individual and familial experiences 
and by the specific historical circumstances that have impinged on people's 
lives. Rather than viewing older people as a homogeneous group, a life course 
perspective considers them as age cohorts moving through historical time, each 
cohort with its distinct life experiences shaped by the circumstances encoun- 
tered earlier in life (Hareven 1978b, Elder 1978). 

The adaptation of individuals and their families to the social and economic 
conditions they face in the later years of life is contingent on the pathways by 
which they reach old age. Relations of mutual support are formed over life 
and are revised in response to historical circumstances such as migration, wars, 
depressions, and the decline or collapse of local economies that people may 
have encountered at various points in their lives. Hence, patterns of support 
and expectations for receiving and providing assistance in old age are part of 
a continuing process of interaction among parents, children, other kin, and 
unrelated individuals. They are shaped by cultural traditions and the strategies 
that people follow over their lives, as they move through historical time 
(Hareven 1981, Elder 1982, Hogan et al 1993). 

Following a historical and life course perspective, this chapter examines 
changes in demographic behavior, in family and household organization, in 
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the timing of life course transitions and in kin assistance in American society, 
as they have affected generational supports in the later years of life. 


THE LIFE COURSE PARADIGM 


The life course approach is developmental and historical by its very nature. 
Its essence is the synchronization of "individual time," "family time," and 
"historical time." Underlying the life course paradigm are three major dimen- 
sions: (i) the timing of life transitions in the context of historical change, (ii) 
the synchronization of individual life transitions with collective familial ones 
and their impact on generational relations, and (iii) the impact of earlier life 
events, as shaped by historical circumstances previously encountered, on sub- 
sequent events. 

The timing of life transitions involves the balancing of individuals’ entry 
into and exit from different roles—education, family, work, and community— 
over their life course. It addresses the key question: How did individuals time 
and sequence these transitions in changing historical contexts? In all these 
areas, the pace and definition of “timing” hinge upon the social and cultural 
contexts in which transitions occurred, and the cultural construction of the life 
course in different time periods and in different societies (Neugarten & Datan 
1973, Clark & Anderson 1976). 

The second dimension of the life course approach involves the synchroni- 
zation of individual life transitions with collective family transitions, such as 
leaving home, getting married, and entering the labor force. Individuals engage 
in a variety of familial configurations that change over the life course and that 
vary in different historical contexts. Although age is an important determinant 
of the timing of transitions, it is not the only significant variable. Changes in 
family status and in accompanying roles are often as important as age, if not 
more so (Hareven 1991a, Hareven & Masaoka 1988, Riley & Riley 1993). 

The synchronization of individual transitions with familial ones is a crucial 
aspect of the life course and impinges directly on generational relations, es- 
pecially when individual goals are in conflict with the needs and dictates of 
the family as a collective unit. In the nineteenth century, for example, the 
timing of young adults’ life transitions often clashed with the demands and 
needs of aging parents. Parents discouraged the youngest daughter from leav- 
ing home and marrying so that she could continue to support them at home in 
their old age. Daughters succumbed to these dictates, despite their preference 
to leave home and start a life of their own (Hareven 1982, Hareven & Adams 
1994). Similarly, the timing of later life transitions affected more than one 
generation. For example, the death of “old, old” parents enabled caretaking 
children who were themselves old to begin providing for their own old age, 
as well as for their adult children or grandchildren (Hogan et al 1994). 
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Historical forces thus play a crucial role in this complex, cumulative pattern 
of individual and familial life trajectories. They have a direct impact on the 
life course of individuals and families at the time when they encounter them, 
and continue to have an indirect impact over the entire life course. This means 
that the social experiences of each cohort are shaped not only by the historical 
events and conditions its members encounter at a certain point in life, but also 
by the cumulative impact of the historical processes that influenced their earlier 
life transitions (Hareven & Masaoka 1988). 

The impact of historical events on the life course may continue over several 
generations. One generation transmits to the next the ripple effects of the 
historical circumstances that affected its life history. Elder & Hareven (1992) 
found that in the same age cohorts in two different communities, delays or 
irregularities in the parents' timing of their work and family transitions as a 
result of the Great Depression affected the subsequent timing of the children's 
life transitions. The children thus experienced the impact of historical events 
on two levels: directly, through their encounter with these events in their early 
adulthood, and indirectly, in the transmission of these events across the gen- 
erations. A life course perspective provides a framework for understanding 
variability in the patterns of support in the later years of life, as well as 
differences in the expectations of the recipients and the caregivers who are 
influenced by their respective social and cultural milieux (Fry & Keith 1982, 
Kohli 1986, Kiefer 1974, Jackson et al 1990). 

Generational assistance is shaped by values and experiences that evolve or 
are modified over the entire life course. In the United States, ethnic values 
rooted in various premigration cultures call for a more exclusive dependence 
on filial and kin assistance than the more contemporary attitudes, which ad- 
vocate reliance on supports available from government programs and commu- 
nity agencies. Such differences in values are expressed in the caregiving 
practices and attitudes of successive cohorts. The earlier life course experiences 
of each cohort, as shaped by historical events, also have an impact on the 
availability of resources for their members and on their modes of assistance 
and coping abilities in the later years of life (Elder 1974, Hareven & Adams 
1994, Elder et al 1992, Sokolovsky 1990). 

Specifically, recent research on the life course has made an important con- 
tribution to the understanding of generational relations and social change by 
refining the distinction between "generation" and "cohort," two concepts that 
have frequently overlapped in the gerontological literature (Bengtson et al 
1985, Hill 1970). “Generation” designates a kin relationship and a genealogical 
lineage (for example, parents and children or grandparents and grandchildren) 
and may encompass an age span as wide as 30 years or more. A “cohort” 
consists of a more specific age group that has shared a common historical 
experience. Most importantly, a cohort is defined by its interaction with the 
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historical events that affect the subsequent life course development of that 
group (Ryder 1965, Riley 1978, Riley et al 1972). Over the past decade and 
a half, several scholars have warned against “confounding the genealogical 
and cohort meanings of ‘generation.’” Building on the major problems that 
Elder (1978) and Vinovskis (1988) identified, Kertzer has pointed out that 
when a population is divided on genealogical principles into various genera- 
tions, there is substantial overlapping in age among the generations. It would 
thus be "impossible to properly characterize the generations in terms of their 
common characteristics vis a vis other generations" (Kertzer 1983:130). 

A generation might consist of several cohorts, each of whom has encoun- 
tered different historical experiences that have affected its life course. For 
example, in Hareven & Adams's (1994) comparison of patterns of assistance 
of two cohorts of adult children to aging parents in a New England community, 
members of the same generation belonged to different cohorts: In families with 
large numbers of.children, siblings in the same family were members of two 
cohorts, who differed in the historical experiences they had encountered and 
in their attitudes toward generational supports. If one intends to examine 
change over time in generational relations, then it is necessary to compare 
cohorts, not generations. 


MYTHS ABOUT THE PAST 


Historical research has dispelled the myths about the existence of an ideal 
three-generational family in the American past, according to which elderly 
parents coresided with their adult children and were supported by the younger 
generations after they reached dependent old age. This research has also 
challenged the prevailing sociological theory that industrialization destroyed 
the great extended households of the past and led to the emergence of a nuclear 
family system and to the isolation of the elderly. In reality, in the American 
colonies and in pre-industrial Europe, coresidence of three generations in the 
same household was not the modal familial arrangement. Given the high 
mortality rate, most grandparents could not have expected to overlap their 
grandchildren's lives for a significant time period (Greven 1970, Demos 1970, 
Laslett & Wall 1971). The great extended families that have become part of 
the folklore of modern society rarely existed in the past (Laslett 1977, Goode 
1963, Hareven 1971). 

As in the present, early American households were nuclear in their structure. 
The older generation resided in households separate from those of their married 
adult children but were located nearby, often on the same land. Opportunities 
for contact and cooperation among the generations abounded in what was 
characterized as a "modified extended family system" (Greven 1970). These 
voluntary, reciprocal relations were different, however, from an institutional- 
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ized stem family system, which characterized the coresidence of generations 
in Central Europe (Plakans 1987). 

Nor was there a “golden age" in the family relations of older people in the 
American or European past. Even in the colonial period, elderly people were 
insecure in their familial supports, though they were revered and accorded 
higher social status than they are today (Fischer 1977). Aging parents had to 
enter into contracts with their inheriting sons in order to secure supports in old 
age in exchange for land. The emphasis in such contractual arrangements on 
specific details suggests the potential tensions and insecurities that parents 
anticipated concerning their care after they became too frail to support them- 
selves (Demos 1978, Smith 1973). 

Similarly, in urban-industrial society in the nineteenth and twentieth centu- 
ries, older people were not guaranteed supports from their children. In Amer- 
ican society, familial support and care for older people, as well as more general 
patterns of kin assistance, historically have been voluntary and based on 
reciprocal relations over the life course (Adams 1968, Sussman 1959). In the 
absence of social security and institutions of social welfare, in the past norms 
dictated that kin engage in intensive reciprocal assistance. Adult children were 
expected to be the main caregivers for their aging parents (Demos 1970, 
Hareven 1982). Still, these patterns of care were voluntary rather than estab- 
lished by law. They depended, therefore, on carefully calculated strategies and 
on negotiated arrangements over the life course (Hareven 1982). 


CORESIDENCE 


The fact that aging parents and adult children rarely coresided in multigener- 
ational households does not mean that the generations lived in isolation from 
each other. Even in urban society, throughout the nineteenth century, solitary 
residence—a practice that has become increasingly prevalent among older 
people today—-was most uncommon for all age groups (Kobrin 1976). Auton- 
omy in old age, however, partly expressed in the opportunity for older people 
to head their own households, hinged on some form of support from an adult 
child living at home or on the presence of unrelated individuals in the house- 
hold (Hareven 1981). The ideal was the generations' residence on the same 
land in rural areas, or in the same building or same neighborhood in urban 
areas. “Intimacy from a distance,” the preferred mode of generational interac- 
tion in contemporary American society, has been persistent since the early 
settlement and reaches back into the European past (Demos 1970, Laslett & 
Wall 1971). 

Despite this overall commitment to residence in nuclear households, com- 
mon to members of various ethnic groups and native-born Americans alike, 
nuclear households expanded to include other kin in times of need, during 
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periods of accelerated migration or housing shortages. The most notable ex- 
tension of the household occurred when elderly parents and especially wid- 
owed mothers were unable to maintain themselves in separate residences. In 
such cases, aging parents had an adult child return to live with them, or they 
moved into a child's household (Smith 1979, 1981, Ruggles 1987, Hareven 
1991b). Since household space was an important economic resource to be 
Shared and exchanged over the life course, the configurations of household 
members changed over the life course in relation to the family's economic 
needs or in response to external opportunities (Hareven 1990). Older people 
whose children had left home shared household space with boarders and 
lodgers in exchange for services or rent, or with their own children who 
returned with their spouses to the parental home because of housing shortages 
or their aged parents' frailty (Chudacoff & Hareven 1979). 

In the later years of life, boarding and lodging served the function of the 
"social equalization of the family," a strategy by which young men or women 
who left their parents' home communities moved into the households of people 
whose own children had left home. Trading household space with boarders 
and lodgers thus made it easier for families to adhere to their traditional values 
without slipping below the margin of poverty. Elderly couples who had no 
children or whose children had moved far away took in boarders and lodgers 
in exchange for money or assistance (Modell & Hareven 1973). 

Boarding provided an important means of mutual exchanges between the 
generations even if they were not related. About one third of the men and 
women in their twenties and thirties living in late nineteenth-century American 
urban communities boarded with other families. For young men and women 
in the transition between leaving their parents' homes and establishing their 
own households, boarding with older people offered affordable housing in 
surrogate familial settings. For older people, particularly for widows, it pro- 
vided the extra income needed to maintain their own residence and helped 
avert loneliness after their own children had left home. In some cases the 
function was reversed, and older people who could not live alone, but who 
had no children or relatives, moved as boarders into other people's households. 

Despite preferences for unrelated individuals, households expanded to in- 
clude kin, though usually for limited periods during times of need or at specific 
points in the life course. The unwritten rules about separate residence of the 
Benerations in American society were modified when aging parents became 
chronically ill or demented and, therefore, unable to live independently. Under 
such circumstances, frail elderly parents usually coresided with a child, or with 
other kin if no children were available (Hareven & Adams 1994). Only about 
12% to 18% of all urban households in the late nineteenth and early twentieth 
centuries contained relatives other than members of the nuclear family (Har- 
even 19772). In urban communities, which attracted large numbers of migrants 
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from the countryside or immigrants from abroad, coresidence with extended 
kin increased greatly over the nineteenth century (Hareven 1971, Hareven 
1982). The proportion of households taking in kin increased to 2596 over the 
early twentieth century, but declined to 7% by 1950 (Ruggles 1987). As noted 
above, the presence of relatives other than members of the nuclear family in 
the household, though, was most common in the later years of life. 

Aging parents and widowed mothers strove to maintain their autonomy by 
retaining the headship of their own households, rather than moving in with 
their children, relatives, or indeed strangers. The powerful commitment to the 
continued autonomy of the household was clearly in conflict with the needs 
of people as they were aging. In the absence of adequate public and institutional 
supports, older people striving to maintain independent households were 
caught in the double bind of living separately from their children, yet having 
to rely on their children's assistance in order to do so (Chudacoff & Hareven 
1979). Holding on to the space and headship of their household in exchange 
for future assistance in old age was an important survival strategy for older 
people in urban society, one reminiscent of the contracts between inheriting 
sons and rural older people in preindustrial Europe and colonial New England, 
as discussed above. To continue living separately in their own household, 
parents ensured that at least one adult child remain at home. In the absence of 
a child, frail elderly people, especially widows, had to move in with relatives 
or strangers (Chudacoff & Hareven 1978, Hareven & Uhlenberg 1994, Smith 
1979, Hareven 1981). 

An examination of patterns of generational coresidence raises several ques- 
tions related to household headship and to the nature of the supports inside 
the household: When a household record in a census listed a parent as being 
the head of the household and an adult child as residing in the household, who 
in reality headed the household, and how did the resources and assistance flow? 
It is difficult to answer these questions from cross-sectional data, nor can this 
type of data provide an explanation as to what the dynamics were in these 
household arrangements. Did the son succeed to household headship after his 
father retired or became too old or frail to support himself, or did the parents 
move into the son’s household? Under what circumstances did the older 
generation coreside with adult children or other kin; and under what circum- 
stances did they reside separately and engage in various types of assistance 
outside the household? It would be impossible to answer these questions from 
cross-sectional data. They need to be addressed, however, when analyzing 
longitudinal and retrospective data. 

Rates of coresidence recorded in cross-sectional data might reflect a life 
course pattern in which elderly parents who did not coreside with their children 
at the time a census or survey was taken might do so later when they became 
more dependent. Cross-sectional data can obscure considerable variation in 
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patterns of coresidence over the life course: In the National Survey of Families 
and Households, for example, only 796 of Americans aged 55 and older with 
a surviving parent had the parent living with them at the time the survey was 
taken. The retrospective questions revealed, however, that one quarter of the 
persons surveyed in their late fifties had had an aging parent living with them 
at some point in their lives. Hogan, Eggebeen, and Snaith (1994), explain this 
discrepancy by the short time period that a sick parent actually resides with a 
child. The parent stays usually less than a year, because after the parent's 
health deteriorates, he or she either enters a nursing home or dies. 


HISTORICAL CHANGES IN THE TIMING OF LIFE 
TRANSITIONS 


Demographic changes in American society since the late nineteenth century 
have significantly affected age configurations within the family and the timing 
of life course transitions, and they have had a major impact on the overlap of 
generations into old age (Uhlenberg 1974, Hareven 1976, Riley 1984). As 
Uhlenberg has pointed out, the decline in mortality since the late nineteenth 
century has resulted in greater uniformity in the life course of American 
families and has dramatically increased the opportunities for intact survival of 
the family unit over the lifetime of its members. An increasing portion of the 
population has lived out its life in family units. Except when disrupted by 
divorce, married couples have been able to live together for longer time 
periods, children have been growing up with their parents as well as with their 
siblings alive, grandparents have overlapped with their grandchildren into the 
latter's adulthood, and great grandparenthood has emerged as a new stage of 
adult life (Uhlenberg 1974, 1978). Longevity and the increasing survival to 
adulthood have maximized the opportunity for generational overlap and the 
incidence of available pools of kin, despite the decline in fertility (Townsend 
1968, Cherlin & Furstenberg 1990). 

Under the impact of demographic, economic, and cultural change, the timing 
of the major transitions to adulthood— particularly leaving home, entry into 
and exit from the labor force, marriage, parenthood, the "empty nest," and 
widowhood—has undergone significant changes over the past century. Under- 
lying these changes has been an increase in age uniformity in the timing of 
life transitions. Over the twentieth century, transitions to adulthood have be- 
come more uniform, more orderly in sequence, and more rapidly timed. This 
timing has become more regulated according to specific age norms, rather than 
in relation to the needs of the family. Individual life transitions have become 
less closely synchronized with collective familial ones, thus causing a further 
separation between the generations (Modell et al 1976). 

By contrast, in the nineteenth century the transitions to adulthood were more 
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gradual and less rigidly timed. The time range necessary for a cohort to 
accomplish these transitions (leaving school, starting work, getting married 
and establishing a separate household) was wider, and the sequence in which 
transitions followed one another was not rigidly established. The nineteenth- 
century pattern of transitions allowed for a wider age spread within the family 
and for greater opportunity for interaction among parents and adult children. 
Later age at marriage, higher fertility, and shorter life expectancy rendered 
family configurations different from those in contemporary society (Uhlenberg 
1978). The increasing rapidity in the timing of the transitions to adulthood, 
the separation of an individual's family of origin from their family of procre- 
ation, and the introduction of publicly regulated transitions such as mandatory 
retirement, have converged to isolate and segregate age groups and generations 
in the larger society (Hogan 1989, Modell et al 1976, Hogan et al 1993). 
Because early and later life transitions are interrelated, these changes have 
affected the status of older people in the family and their sources of support, 
generating new kinds of stresses on familial needs and obligations. In the 
nineteenth century the timing of later life transitions to the empty nest, to 
widowhood, and out of the headship of one's own household, followed no 
ordered sequence and extended over a relatively longer time period. Older 
women did experience more marked transitions than men because losing a 
spouse in old age was more characteristic of the experience of women than of 
men; however, the continuing presence of at least one adult child in the 
household meant that widowhood did not necessarily represent a dramatic 
transition into the empty nest (Chudacoff & Hareven 1979, Smith 1979). 
Tbe most pronounced discontinuity in the adult life course during the twen- 
tieth century, especially since World War II, has been the empty nest stage. 
A modal pattern of the middle and later years of life, the empty nest emerged 
as a result of the decline in mortality, and the combination of earlier marriage 
and the bearing of fewer children overall, with closer spacing of children and 
the more uniform pattern of children leaving home earlier in their parents’ 
lives. This meant that a couple experienced a more extended period of life 
without children, beginning in their middle years. A separation between the 
generations thus occurred when parents were still in middle age (Glick 1977). 
By contrast, in the nineteenth century, the residence of children in the 
parental household extended over a longer time period, sometimes over the 
parents’ entire life. Most importantly, the nest was rarely empty, because 
usually one adult child was expected to remain at home while the parents were 
aging (Smith 1981). Demographic factors account only in part for the empty 
nest. Children did not remain in their aging parents’ household simply because 
they were too young to move out. Even when sons and daughters were in their 
late teens and early twenties, at least one child stayed in the parental home to 
care for aging parents if no other assistance was available (Chudacoff & 
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Hareven 1979, Hareven 1982). Leaving home did not so uniformly precede 
marriage, and the launching of children did not necessarily leave the nest 
empty. As mentioned above, occasionally a married child returned to the 
parental home, or the parents took in boarders or lodgers. 

The timing of life transitions in the nineteenth century was erratic because 
it followed family needs and obligations rather than specific age norms. Fa- 
milial obligations, dictated by economic insecurity and by cultural norms of 
kin assistance, took precedence over strict age norms (Modell et al 1976). Over 
the twentieth century, on the other hand, age norms have emerged as more 
important determinants of timing than familial obligations. As Modell et al 
(:30) concluded: “ ‘Timely’ action to 19th century families consisted of helpful 
response in times of trouble; in the 20th century, timeliness connotes adherence 
to a socially-sanctioned schedule." As greater differentiation in stages of life 
began to develop following the turn of the century, and as social and economic 
functions became more closely related to age, a segregation between age 
groups emerged, and with it, an increasing separation among the generations. 
This separation occurred first in the middle class and was only later extended 
to the working class. The pattern still varies considerably among ethnic groups 
and among black families in contemporary society (Markides et al 1981, 
Markides & Krause 1985, Jackson, Jayakody & Antonucci 1994, Taylor et al 
1993). 

Age uniformity in the family and age segregation among generations, how- 
ever, may be modified as a result of remarriage following divorce. As Fursten- 
berg pointed out, remarriage recreates a kinship configuration that resembles 
those resulting through remarriage following the premature death of a partner 
in the eighteenth or nineteenth century. In contemporary society, remarriage 
following divorce may have a similar effect of incorporating new spouses and 
step siblings into a “blended family," who differ considerably in their ages. 
As age again becomes heterogeneous within the family, Furstenberg concludes, 
generational boundaries become less distinct in the everyday life of the family, 
and consequently, "the salience of generational boundaries in the larger society 
may decline as well" (Furstenberg 1981:136). 

Since the 1980s, more erratic and flexible patterns in the timing of life course 
transitions have emerged again. This pattern departs from the earlier age-re- 
lated rigidities in timing to reflect changes in family arrangements and new 
policies governing the work life. The movement of young adult children in 
and out of the parental home has become more erratic. Young adults stay on, 
or return to the parental home after having left previously (Goldscheider & 
DaVanzo 1985, Goldschieder & Goldscheider 1986). This contemporary pat- 
tern, however, differs from that of the past in a fundamental way: Ín the late 
nineteenth century children continued to stay in the parental home or moved 
back and forth in order to meet the needs of their family of orientation by 
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taking care of aging parents or, in some cases, of younger siblings. In contem- 
porary society, young adult children reside with their parents in order to meet 
their own needs, because of their inability to develop an independent work 
career or to find affordable housing. Another contemporary variant of the 
filling of the nest is the return of divorced or unmarried daughters with their 
own young children to the parental household (McLanahan 1988). In this 
instance as well, the main purpose generally has not been for the daughter to 
assist her aging mother but rather to receive help in housing and child care. 

In contemporary American society one is accustomed to thinking of most 
transitions to family roles and work careers as individual moves. This may 
differ among certain ethnic and cultural groups, where strong patterns of kin 
assistance are still extant. In the past, on the other hand, the timing of individual 
transitions had to be synchronized with familial ones. The family was the most 
critical agent in initiating and managing the timing of life transitions. Control 
over the timing of individual members' transitions was a crucial factor in the 
family's efforts to manage its resources, especially to balance different mem- 
bers' contributions to the family economy. The absence of a narrow, age-re- 
lated timing of transitions to adult life allowed for a more intensive interaction 
among different age groups within family and community, thus providing a 
greater sense of continuity and interdependence among people and among 
generations at various points in the life course. 

In a historical context, early life transitions were bound up with later ones 
in a continuum of familial needs and obligations. Hence the life transitions of 
the younger generation were intertwined with those of the older generation. 
Specifically, the timing of children's leaving home, getting married, and setting 
up a separate household was contingent on the timing of parents' transitions 
into retirement, inheritance, or widowhood (Greven 1970, Hareven 1982). This 
interdependence dictated parental control over the timing of adult children's 
life transitions. The strategies that parents and children followed in determining 
exchanges and supports in relation to the timing of life transitions represent, 
therefore, important theoretical and empirical issues that require further explo- 
ration. 


INTERDEPENDENCE AMONG KIN 


Contrary to prevailing myths and sociological theories, urbanization and in- 
dustrialization did not break down traditional ties and reciprocal relations 
among kin (Anderson 1971, Hareven 1978c, Sussman 1959). Whether they 
resided separately or in the same household, for members of the nuclear family 
interdependence with extended kin was at the base of survival. Kin served as 
the most essential resource for economic assistance and security and shoul- 
dered the major burden of welfare functions for individual family members. 
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Historical scholarship has documented the survival of kinship ties and viable 
functions of kin following the process of industrialization in the nineteenth 
century. During the nineteenth century and the early part of this century, kin 
fulfilled a central role in facilitating migration, in locating jobs and housing, 
and in providing assistance during critical life situations. Kin assistance was 
pervasive in urban neighborhoods and extended back to the communities of 
origin of immigrants and migrants through various exchanges. Immigrants in 
the United States often sent back remittances for their aging parents and other 
relatives in their home communities, while relatives remaining behind took 
care of aging parents and family farms (Anderson 1971, Hareven 1982). 

Under the historical conditions in which familial assistance was the almost 
exclusive source of security, the multiplicity of obligations that individuals 
incurred over life toward their kin was more complex than in contemporary 
society. In addition to the ties they retained with their family of origin, indi- 
viduals carried numerous obligations toward their family of procreation and 
toward their spouses' family of origin. Such obligations cast men and women 
into various overlapping and, at times, conflicting roles over the course of their 
lives. The absence of institutional supports in the form of welfare agencies, 
unemployment compensation, and social security added to the pressures im- 
posed on the kin group. In the regime of economic insecurity characteristic of 
the nineteenth century and the first part of this century, kin assistance was the 
only constant source of support. Family coping, by necessity, dictated that 
individual choices be subordinated to collective familial considerations and 
needs. Supports from kin were crucial during critical life situations such as 
unemployment, illness or death, as well as for normative life transitions (An- 
derson 1971, Hareven 1982). 

This strong interdependence among kin meant that individual choices had 
to be subordinated to collective family needs. Individuals' sense of obligation 
to their kin was dictated by their family culture. It expressed a commitment 
to the survival, well-being, and self-reliance of the family, which took priority 
over individual needs and preferences. Autonomy of the family, essential for 
self-respect and good standing in the neighborhood and community, was one 
of the most deeply ingrained values (Hareven 1982). 

Mutual assistance among kin, although involving extensive exchanges, was 
not strictly calculative. Rather, it expressed an overall principle of reciprocity 
over the life course and across generations. Individuals who subordinated their 
own careers and needs to those of the family as a collective unit did so out of 
a sense of responsibility, affection, and familial obligation, rather than with 
the expectation of immediate gain. Such sacrifices were not made without 
protest, however, and at times involved competition and conflict among sib- 
lings as to who should carry the main responsibility of support for aging 
parents. Close contact and mutual exchanges among parents, their adult chil- 
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dren and other kin persisted throughout the nineteenth century and survived 
into the twentieth century in various forms in the lives of working-class and 
ethnic families. Parents expected their grown children to support them in their 
old age in exchange for supports they had rendered their children earlier in 
life. Societal values rooted in ethnic cultures provided ideological reinforce- 
ments for these reciprocal obligations (Hareven 1982, Eggebeen & Hogan 
1990). 

Changes in the configurations of kin resulting from the increase in divorce 
rates and remarriage during the past three decades may have important im- 
plications for the available kinship pools in the later years of life, as well as 
for generational boundaries within a kinship group. As Furstenberg (1981) 
stresses, divorce has not necessarily led to the depletion of existing kinship 
ties. Especially where the older generation is involved with child-rearing, ties 
with grandparents survive divorce. When divorce is followed by remarriage, 
the kinship pool expands through the addition of new relatives, without relin- 
quishing the existing ones. This "remarriage chain" as Furstenberg refers to 
it, links family and present conjugal partners and their relatives, primarily 
through the child. Since the older generation is likely to invest in child-rearing, 
Furstenberg concludes that children reared within these complex configura- 
tions will maintain stronger obligations toward future supports for the older 
generation (1981:137). 

Building on Furstenberg's identification of complex kinship networks in 
contemporary society, Riley & Riley further expand the definition to include 
non-kin, who were members of the network as a result of cobabitation and 
other associations with members of the nuclear family: “Indeed, the emerging 
boundaries of the kin network may be more closely influenced by gender, or 
even by race and ethnicity, than by age or generation. Instead, the boundaries 
of the kin network have been widened to encompass many diverse relation- 
ships, including several degrees of stepkin and in-laws, single parent families, 
adopted and other "relatives" chosen from outside the family, and many oth- 
ers...." Riley & Riley characterize these complex kinship and fictitious kinship 
relations as “a latent web of continually shifting linkages that provide the 
potential for activating and intensifying close kin relationships as they are 
needed" (Riley & Riley 1994, p. 6). 

This phenomenon is not entirely new. In the nineteenth century and in the 
first part of this century, various ethnic groups and working class families were 
enmeshed in similar networks consisting of fictitious kin as well as kin. Some 
of these networks stretched across wide geographic regions, linking migrants 
or immigrants in the communities of settlement with their kin in the commu- 
nities of origin into one social system. Some of these ties were latent or 
activated in accordance with various needs (Hareven 1982). The significance 
of these new networks is that they transcend the boundaries of age and gen- 


AGING AND GENERATIONAL RELATIONS 451 


erations, which may enhance their flexibility and efficacy. When comparing 
the contemporary networks to the historic ones, however, one needs to assess 
their long-term effectiveness. The historic networks were also flexible and 
fluid, but the expectations for supports were clearly defined by long-standing 
norms of reciprocity. For the contemporary networks one would need extensive 
research to identify the rules and principles by which they operate and to follow 
them longitudinally in order to determine their durability. 


GENERATIONAL SUPPORTS OVER THE LIFE COURSE 


Despite the strong tradition of kin assistance, spouses and children have been 
the main caregivers for aging parents. Both historical and contemporary studies 
of supports for older people have identified adult children, most commonly 
daughters, as the main caregivers, where spouses were not available. Even in 
time periods and among ethnic groups where individuals were deeply embed- 
ded in reciprocal relations with extended kin, the main responsibility of car- 
egiving, particularly for frail, elderly parents, was that of children. Kin pro- 
vided sociability and occasional help, but the day-to-day involvement with 
caregiving fell upon the children. Regardless of how many children a couple 
had, one child usually emerged in the role of caregiver (Smith 1979, Chudacoff 
& Hareven 1979). 

Assistance among the generations stretched across the life course and tended 
to be mutual, informal, and recurrent under normal circumstances, as well as 
during critical life situations. Adult children's involvement with the care of 
their aging parents was closely related to their earlier life course experiences, 
to their respective ethnic and cultural traditions, and to the historical context 
affecting their lives. Routine assistance from children to aging parents prepared 
the children to cope with parents' later life crises, especially widowhood and 
dependence in old age. 

When both parents survived into old age and were able to cope on their 
own, the children were more likely to try to maintain them in the parental 
home. Children made an effort to have their parents reside nearby, preferably 
in the same building or the same block, but not in the same household if it 
could be avoided. In cases of illness or need, children visited their parents on 
a daily basis, arranged for medical treatment, provided bodily care, prepared 
meals, and ran errands. After the death of one parent, children temporarily 
took in the surviving parent. In some cases, several siblings contributed jointly 
by hiring a nurse to take daily care of a frail parent who was still living at 
home (Hareven & Adams 1994, published). Children, most commonly daugh- 
ters, took a parent into their own household under circumstances of extreme 
duress, such as when parents were too frail to live alone or when they needed 
extensive help with their daily activities and regular care. There was no pre- 
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scribed rule as to which child would become a “parent keeper." If the child 
was not already residing with the parent, the selection of a child for that role 
was governed by a particular child's ability and willingness to take the parent 
in, by the consent or support of the parent keeper’s spouse, and by the readiness 
of the parent to accept the plan (Hareven & Adams 1994). 

Life course antecedents were crucial determinants of whether an individual 
was cast in the role of "parent keeper." Some children took on this responsi- 
bility because of a sudden family crisis, but most evolved into this role over 
their life course. Most commonly, the parent keeper was the child who con- 
tinued to reside with a parent after the other siblings had left home. Even when 
both parents were alive, as noted above, the youngest daughter was expected 
to remain at home and postpone or give up marriage in order to ensure support 
for the parents in their old age. This pattern was pervasive among various 
ethnic groups until World War II (Hareven 1982). Caretaking daughters gave 
up marriage altogether or sometimes waited for decades unti] their parents died 
before they could marry. They fulfilled their role at a high price to themselves 
and to their spouses and other family members. As various gerontological 
studies have emphasized, caregiving disrupted a daughter's work career, led 
to crowding in her household, caused tension and strain in her marriage, and 
made her vulnerable in preparing for her own and her spouse's retirement and 
old age (Hareven & Adams 1994, Brody 1990, Cantor 1983). 

More recent studies have questioned what may be an excessive emphasis 
on “women in the middle" as the main caregivers for aging parents. Several 
of these studies have explored the respective roles of females and males as 
caregivers and have concluded that while men performed managerial and 
maintenance tasks, as well as providing financial and social supports, the 
women predominantly performed the daily hands-on caregiving (Dwyer & 
Coward 1991, Kaye & Applegate 1990, Dwyer & Seccombe 1991, Hareven 
1993). Historically, it was the women who maintained the life-long role of 
“kin keepers.” Although needs and responsibilities changed, the kin keepers’ 
centrality to the kin networks as helpers, arbiters and pacifiers continued and 
became even more pivotal with age. It was the kin keeper who became the 
primary caregiver and who marshalled other relatives and negotiated their 
subsidiary roles in the effort to provide care for frail elderly parents (Hareven 
1982). 


COHORT LOCATION IN HISTORICAL TIME 


Historical changes in generational supports and in attitudes toward receiving 
and providing such supports are best reconstructed through a comparison of 
cohorts who encountered different historical circumstances. In their study of 
two cohorts of the adult children of immigrants to the industrial community 
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of Manchester, New Hampshire, Hareven & Adams found significant differ- 
ences in the attitudes and practices of the two cohorts in relation to their earlier 
historical experiences. An earlier cohort (born 1910-1919), who came of age 
during the Great Depression, and a more recent cohort (born 1920-1929), who 
came of age during World War II, were both the children of a historic cohort 
who had migrated to Manchester to work in the textile industry. Members of 
the earlier cohort, toughened by the Depression, were primarily concerned with 
keeping the entire family afloat economically: They pooled resources, doubled 
up on housing, and supported their aging parents and other needy relatives. 
They held to the traditional ideologies of relying on kin rather than on public 
agencies (Hareven 1982). 

By contrast, members of the more recent cohorts took advantage of the 
economic recovery brought about by World War II and were eager to develop 
middle class life styles. They devoted themselves to improving their own lives 
and their children's future, and they were more prepared to accept government 
help or nursing homes for their parents. They were more likely to live sepa- 
rately and thus drew firmer boundaries between younger and older generations. 
Neither the earlier nor the later children's cohort, however, was free of the 
complexities involved in handling the problems of generational assistance. 

While the earlier children's cohort had a more clearly defined commitment 
to collective family values and kin assistance, its members, who had actually 
gone through the experience of caring for elderly parents in their own home 
or had sacrificed their own marriage for parental care, did not do so without 
ambivalence, doubt, bitterness, or the threat of a lonely old age for themselves. 
Their posture had often been one of resignation to familial norms and accep- 
tance of "fate" rather than free choice. Members of the later cohort, on the 
other hand, who followed a more individualistic course, were not free of guilt 
over the way in which the support of their aging parents had been worked out. 

Both cohorts were, to some degree, transitional between a milieu of deep 
involvement in generational assistance, reinforced by strong familial and ethnic 
values, and the individualistic values and lifestyles that emerged in the post- 
World War H period. In this historical process, the earlier cohort's lives 
conformed more closely to the script of their traditional familial and ethnic 
cultures, while the later cohort, as it Americanized, was being pulled in the 
direction of individualistic middle-class values. The transition was by no means 
completed. Members of the later cohort had not entirely freed themselves of 
their traditional upbringing. Both cohorts still expressed their parents' values, 
but the later cohort feit less able or inclined to implement them (Hareven & 
Adams 1994). 

This comparison of the cohorts reveals differences in the practice of care- 
taking and in the attitudes toward the responsibilities of caring for aging 
parents. Both cohorts were transitional in the sense that they were still strongly 
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bound by their parents’ values and expectations that children should serve as 
the major caretakers. Both cohorts attempted to fulfill this script at a high price 
to their own relationship to their spouses, to their ability to help their grown 
children, and to their preparation for their own "old age." The earlier of the 
two children's cohorts was more inclined to live by their parents’ cultural 
script, despite the fact that its members were more vulnerable as their own 
“old age" approached. The later of the children's cohorts was more ambivalent 
and more conflicted about a commitment to care for a frail or chronically ill 
parent. They were especially hesitant about taking such a parent into their own 
home. 

The parent cohort had been the major supporters of their aging parents. They 
viewed kin as their exclusive source of assistance over the life course. For that 
very reason, they also expected their main support in old age to come from 
their children. They tried to remain self-sufficient as long as possible, however. 
Paradoxically, they viewed all supports from their children as part of the 
family's self-reliance and ranked assistance from their adult children as their 
highest priority, followed by assistance from extended and more distant kin. 
Given their commitment to self-reliance within the family, they viewed public 
welfare as a last resort. 

While the parents expected their children to assist them in old age, the 
children did not expect (want) to have to rely on their own children for 
economic support. They prepared for their later years through pension plans, 
savings and home ownership, and expected to rely on social security, and, if 
needed, on assistance from the welfare state. In cases of illness or disability, 
they hoped to be in a nursing home. The most they expected from their children 
was emotional support and sociability. This attitude was also a result of these 
cohorts’ becoming accustomed to assistance from public agencies and to 
interacting with bureaucratic institutions over the life course (Hareven & 
Adams 1994). 

The difference between the two children's cohorts and the parents thus 
reflects the historical process of an increasing individualization in family 
relationships, and a reliance on public agencies and bureaucratic institutions 
to shoulder the responsibilities for the care of dependent elderly. The historical 
process is well known, but the detailed analysis of the interviews of the cohorts' 
members provides first-hand testimony about how this change was perceived 
and experienced by the women and men who were caught up in it. The process 
is one of increasing separation between the family of origin and the family of 
procreation over the past century, combined with a privatization of family life 
and the erosion of mutual assistance among kin. 

These historical changes have tended to escalate insecurity and isolation as 
people age, most markedly in areas of need that are not met by public welfare 
programs. Although some of the intensive historical patterns of kin interaction 
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have survived among first-generation immigrant, black, and working-class 
families, a gradual weakening of mutual assistance among kin over time has 
occurred (Jackson et al 1988). Sokolovsky and others have warned against a 
romanticization of generational interdependence among more recent immigr- 
ant groups, such as hispanics or Koreans and among blacks (Sokolovsky 1990, 
Mutran 1986, Burton & deVries 1992, Taylor & Chatters 1991, Dowd & 
Bengtson 1978, Burton & Dilworth-Anderson 1991). 

How consistent and continuous the support from nonresident children or 
other kin to aging relatives has been in the United States is still widely open 
to future research. The studies in gerontology or sociology insisting on the 
persistence of kin supports for older people in contemporary society have not 
documented the intensity, quality, and consistency of these supports in meeting 
the needs of older people, especially of the frail and chronically ill elderly. 
Most of these studies have used visiting patterns and telephone communication 
as evidence, rather than regular caregiving and coresidence (Litwak 1965, 
Shanas 1979). 

Recent studies have provided more systematic evidence of various supports 
from adult children to aging parents, especially for the “old, old" in contem- 
porary society. Some of these supports involve coresidence; in other cases, the ` 
caretaking child provides assistance in the parent's household (Brody et al 
1983, Brody 1990, Dwyer & Coward 1991). The contact that older people 
have with kin, as Shanas (1979) and others have found, might represent a form 
of behavior characteristic of specific cohorts rather than a persistent pattern. 
The cohorts who are currently aged, especially the “old, old," have carried 
over their historical attitudes and traditions advocating an almost exclusive 
reliance on kin. Historical precedents also reveal the high price that kin had 
to pay in order to assist each other without the appropriate public supports 
(Hareven 1978b, 1982). 

Except for members of certain ethnic groups, future cohorts, as they reach 
old age, might not have the same strong sense of familial interdependence 
characteristic of earlier cohorts, nor might they have sufficiently large pools 
of kin on whom to rely. Rossi & Rossi (1990) have discovered that in the 
population sample they studied in Boston, the younger respondents expressed 
a stronger sense of normative obligations to kin than did the older ones. On 
this basis the authors concluded that obligations to kin have not been declining 
over historical time. This assertion would need to be tested, however, by 
comparing the attitudes of the group studied in Boston with their age counter- 
parts in the late nineteenth century. 

The major changes that have confounded the problems of older people in 
contemporary society were rooted not so much in changes in family structure 
or residential arrangements of the generations, as has generally been argued, 
but rather in the transformation and redefinition of family functions and of 
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values governing family relations and generational assistance. Over the nine- 
teenth century, the family surrendered many of the functions previously con- 
centrated within it to other social institutions. A retreat from public life and a 
growing commitment to the privacy of the family in the middle-class have led 
to the drawing of sharper boundaries between family and community and have 
intensified the segregation of age groups within and outside the family (Demos 
1970, Rothman 1971). 

The transfer of social-welfare functions from the family to public institutions 
over the past century and a half has not been fully consummated, however. 
The family has ceased to be the only available source of support for its 
dependent members, and the community has ceased to rely on the family as 
the major agency of welfare and social control. Who actually provides supports 
for the elderly and what form those supports take have been subject to ambi- 
guities. On the one hand, family members assume that the public sector carries 
the major responsibilities of care for the aged; on the other hand, the public 
sector assumes that the family is responsible for the major supports. This 
confusion in the assignment of responsibilities often means that old people are 
caught between the family and the public sector without receiving proper 
supports from either. 

The expectation that family and kin carry the major responsibilities for the 
care of aged relatives still prevails, without the provision of the necessary 
supports that would enable kin to discharge such responsibilities (Litwak 
1985). The decline in instrumental relations among kin and their replacement 
by an individualistic orientation toward family relations, with sentimentality 
and intimacy as the major cohesive forces, has led to the weakening of the 
role of kin assistance in middle-class families in particular, and to an increasing 
isolation of the elderly in American society (Hareven 1977b). 


CONCLUSION 


In addition to its obvious role of providing an understanding of change over 
time, a historical perspective serves two additional functions: It enables us to 
compare contemporary phenomena with similar ones in the past, in order to 
assess how new or different from their predecessors they really are; and, 
secondly, it offers models of coping from the past that may be modified and 
utilized in the present. 

In this article, a historical and life course perspective has helped identify 
the complexity of social change itself. Even though it is possible to trace some 
general trends—such as the emergence of age segregation, separation among 
the generations and individualization—a warning against following a linear 
path of change is in order. Rather than being linear, the process of social change 
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has been uneven and multilayered. Historic forms of household extension and 
coresidence with relatives, which have disappeared over the twentieth century, 
have reappeared in different configurations. Complex kinship networks char- 
acteristic of earlier centuries, which resulted from remarriage following the 
death of a spouse, have reappeared in different forms in “blended families” 
following divorce. Recently the trend toward age uniformity and streamlined 
timing of life transitions has been modified or reversed by the return of an 
“erratic life course,” which is similar to the nineteenth century one, but driven 
by different social forces. 

When examining changes over time in the family, the life course, and 
generational relations in American society, one needs to pay attention to 
differences in class and ethnicity. The most important dimension still absent 
from studies of long-term changes in the family and generational relations 
involves systematic differences among social classes and ethnic groups. We 
need a more detailed understanding of the historical process by which patterns 
of family behavior that first emerged in the middle class were transferred to 
other classes, and by what process. When generalizing on long-term changes 
by comparing contemporary patterns to past ones, it is essential to specify what 
ethnic groups or social classes are being compared. 

Any examination of changes in generational relations and in the family over 
time needs to take into consideration the diverse nature of “the family” itself, 
rendered fluid by shifts in internal age and gender configurations across regions 
and over time. Since the family is the arena in which generational relations 
are acted out, one needs to achieve a clearer definition of “family” and “kin- 
ship." It is important to understand the boundaries and the overlap between 
the two, especially as one examines the newer, complex kinship configurations 
that have emerged in contemporary society. 

As mentioned above, the historical experience can provide important models 
for coping in the present. Even though longevity and its inherent social prob- 
lems is a unique phenomenon of our times, the more general aspects of 
providing generational assistance and supports, and coping with dependence 
in old age, can be enhanced by drawing on models of familial adaptation in 
the past. In doing so, one must not lose sight of the dramatic social, economic, 
and institutional differences in the contemporary context. For example, it 
would be beneficial to adapt past models of generational and kin assistance 
and surrogate familial arrangements for the support of older people. When 
doing so, it would be counterproductive, however, to idealize these patterns 
from the past and to expect kin to “take care of their own." 
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